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Abstract 

The general MIMO interference network is considered and is named the B-MAC Network, for it is 
a combination of multiple interfering broadcast channels and multiaccess channels. Included as special 
cases are interference channels, X channels, X networks, and most practical wireless networks. The 
optimization of this important class of networks has been hindered by that beyond the single antenna 
q _ multiaccess channels, little is known about the optimal input structure, which is found here to be the 

polite water-filling, satisfied by all Pareto optimal input. This network version of water-filling is polite 
£>. ■ because it optimally balances between reducing interference to others and maximizing a link's own rate, 

OO 
<N 

, most network optimization problems. Deeply connected is the rate duality extended to the forward and 

reverse links of the B-MAC networks. As a demonstration, weighted sum-rate maximization algorithms 



offering a method to decompose a network into multiple equivalent single-user channels and thus, paving 
the way for designing/improving low-complexity centralized/distributed/game-theoretic algorithms for 



with superior performance and low complexity are designed for B-MAC networks and are analyzed 
for Interference Tree (iTree) Networks, a sub-class of the B-MAC networks that possesses promising 
properties for further information theoretic study. 

Index Terms 

Water-filling, Duality, MIMO, Interference Channel, One-hop Network, Transmitter Optimization, 
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I. Introduction 

A. System Setup 

The paper considers the optimization of general one-hop multiple-input multiple-output (MIMO) 
interference networks. Such a network can be represented by a bipartite graph with multi-antenna 
transmitter nodes on the left and multi-antenna receiver nodes on the right, with non-zero channels 
between them as edges. Each transmitter may send independent data to multiple receivers and 
each receiver may collect independent data from multiple transmitters. Consequently, the network 
is a combination of multiple interfering broadcast channels (BC) and multiaccess channels (MAC) 
and we name it the Broadcast-Multiaccess Channel (B-MAC), the B-MAC network, or the B- 
MAC Interference Network. It includes BC, MAC, interference channels Q-lEl, X channels flU, 
0, X networks 0, and most practical communication networks, such as (cooperative) cellular 
networks, wireless LAN, and digital subscriber line, as special cases. Therefore, optimization of 
such networks has both theoretical and practical impact. 

We assume Gaussian input and that each interference is either completely cancelled or treated 
as noise. A wide range of interference cancellation is allowed, from no cancellation to any 
cancellation specified by a binary coupling matrix of the data links, as long as each signal 
is decoded and possibly cancelled at no more than one receiver. For example, simple linear 
receivers, dirty paper coding (DPC) Q at transmitters, and/or successive interference cancellation 
(SIC) at receivers may be employed. For the more sophisticated Han-Kobayashi scheme where 
a common message is decoded at more than one receiver, the results in this paper may be 
generalized as discussed in Remark [7] of Section IIII-BI and in Section ED The reasons of 
considering the above setting are as follows. 1) The capacity regions and optimal transmission 
strategies of B-MAC networks are unknown except for the special cases of BC and MAC, where 
DPC and SIC are optimal. Even for the well studied MIMO BC and MAC, the optimal input 
structure is unknown in general, except for the water-filling structure for sum-rate optimal points 
1181- lfTOl . 2) Low complexity is desired in practical systems. The more sophisticated techniques 
such as Han-Kobayashi scheme fl2], [fTTI . Ifl"2ll may not be implementable if the transmission 
schemes of the interference are not available and thus, cannot be decoded. 3) Treating weak 
interference as noise is sometimes optimal [fT3l - [[T8l . 4) Limiting to the above setting enables us 
to make progress and gain insight into the problem. The results and insight serves as a stepping 
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stone to more sophisticated design. 

B. Single-user (Selfish) Water-filling 

The generally non-convex optimization of this important class of networks has been hindered 
by that beyond the full solution in the 1998 seminal work [[T9l on single antenna multiaccess 
channels, little is known about the structure of the Pareto optimal input of the achievable rate 
region. Finding the boundary points of a MAC/BC capacity region, can be done using modified 
general purpose convex programming, e.g., Il20ll . But it is desirable to exploit the structure of the 
problem to design more efficient and lower complexity algorithms that will work well for more 
general networks where the problems are likely non-convex. For the special case of sum-rate 
maximization in MIMO MAC, a water- filling structure is discovered in (8). The sum-rate can 
be combined into one logdet function as in the single-user case. Then, one user's optimal input 
has the single-user water-filling structure when other users' signals are considered as noise. For 
individual power constraints, it results in a simple and efficient iterative water- filling algorithm 
[H. Using duality, the approach is modified for sum-rate maximization for BC to take care of sum 
power constraint instead of individual power constraints O, IfTOl . The above approach works 
because the objective function happens to look like a single-user rate and thus, cannot be used for 
weighted sum-rate maximization, which depends on other Pareto optimal points. The approach in 
||9l is generalized for weighted sum-rate maximization with a single antenna at receivers II2TTI . Its 
generalized water-filling structure no longer has an exact water-filling interpretation and cannot 
be used for the cases with more than one receive antennas. 

Directly applying single-user water-filling to networks is referred to as selfish water-filling 
here. It is well known to be far from optimal [|22 | -[[24 | . The reason is that selfish water- filling 
only considers the interference from others, without considering the interference to others, when 
maximizing a link's own rate. Based on selfish water-filling, the game-theoretic, distributed, and 
iterative algorithms have been well studied for the digital subscriber line [|22|. [|25l - [|29l . which 
has parallel SISO (single-input single-output) interference channels, for MIMO interference 
channels, e.g., Il30l - ll33l . and for multiaccess channels, e.g., Il24l . The uniqueness of the Nash 
equilibrium is established and the algorithms converge only under stringent conditions. Even in 
the case of convergence, the performance is rarely optimal. 

The importance of controlling interference to others has been recognized in literature, e.g., 
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Il34l - ll371 . But a systematic, general, and optimal method has not been found. In ||34ll , 113511 , 
11371 . SISO (parallel) interference channels is considered. The idea in ll34l is to minimize each 
link's individual interference to the single most vulnerable user subject to this link's minimum 
rate requirement. In [[3711 , it is proposed to levy a tax on each user's power. The tax rate of a 
user is the negative of the derivative of other users' weighted sum-rate over the user's power. 
In 11351 Section IV], after directly solving the Karush-Kuhn-Tucker (KKT) conditions, a term 
tk(n), which is the same as the previous tax rate, appears in the solution and affects the water- 
filling level. In ll3~6ll . it is recognized that the traditional water- filling is selfish and in a MEMO 
interference channel, the direction causing less interference to others is the same as the direction 
receiving less interference from others in the reverse link. 
Consequently, the following problems have been open. 

• In single-user MIMO channels or parallel channels, no one uses general purpose convex 
programing for transmitter optimization because we know the optimal input structure is 
water-filling and thus, we only need to optimize the parameters of the water- filling. Why 
haven't we been able to do the same for networks? 

• What is the optimal input structure of all boundary points of the MIMO MAC/BC capacity 
region? 

• Is it possible that there exists some kind of water-filling that is different from the single- 
user water-filling and is optimal for all boundary points of the MAC capacity region? If so, 
it can be used to design low-complexity high-performance algorithms for all optimization 
problems related to the boundary points. General purpose convex programing is no longer 
necessary. 

• Even more ambitious questions are for the B-MAC networks. These general interference 
networks are much less understood and the related optimization problems are generally 
non-convex. What is the optimal input structure of all the boundary points of the achiev- 
able region? What is the optimal method to control interference to others while avoiding 
interference from others in a network? Can we decompose a network to multiple equivalent 
single-user channels so that the (distributed) optimization can be made easy? Does there exist 
a network version of water-filling that is optimal? Can we use the optimal input structure to 
design low complexity algorithms for all related optimization problems, including those for 
cellular networks, multi-cell cooperative networks, cognitive radios, and digital subscriber 
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lines, so that it is hard not to find near optimal solutions even in non-convex cases? 
The main contribution of this paper is an optimal network version of water-filling for the above 
problems. 

Related method to water-filling is zero forcing, which has good performance at high signal- 
to-noise power ratio (SNR). Spatial interference alignment ||38l can be viewed as the network 
counter part of zero-forcing in broadcast channels and can be found by transmitter optimization 
algorithms. In ll36ll . the duality based alternating optimization is employed to design distributed 
interference alignment algorithms. However, interference alignment is usually suboptimal at 
moderate or low SNR. Better algorithms solving the problem of how to choose the right number 
of data streams for each link is given in 11391 . Il40ll . In flU, interference alignment/zero-forcing 
is employed to maximize the multiplexing gain of the X channel, which is a combination of 
two-user MAC and two-user BC. However, the iterative schemes in [H are specialized for the 
two user X channel and generalizations to general networks are not straightforward. The new 
water-filling in this paper automatically results in near optimal zero forcing transmission at high 
SNR and may be modified to conduct zero forcing directly for general interference networks. 

C. SINR and Rate Duality 

In this paper, we extend the MAC-BC rate duality to the forward and reverse links of the 
B-MAC networks. The duality has a deep connection to the new water-filling structure and can 
be used to design low complexity iterative algorithms. The extension is a simple consequence of 
the signal-to-interference-plus-noise power ratio (SINR) duality. The SINR duality states that if 
a set of SINRs is achievable in the forward links, then the same SINRs can be achieved in the 
reverse links when the set of transmit and receive beamforming vectors are fixed and the roles 
of transmit and receive beams are exchanged in the reverse links. Thus, optimizing the transmit 
vectors of the forward links is equivalent to optimizing the receive vectors in the reverse links, 
which can reduce complexity because receive vector optimization is simpler. The SINR duality 
between single-input multiple-output (SIMO) MAC and multiple-input single-output (MISO) BC 
was first found in [|4T| and further developed in ||42l|. Il43ll . Alternating optimization based on 
it has been employed in B2l . [|44ll - [|49l . Significant insight and progress was achieved in Il50ll . 
where the SINR duality is extended to any one-hop MIMO networks with linear beamformers 
by the Lagrange duality of linear programming. 
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The MAC-BC rate duality has been established in ||5Ti - ||54l . The forward link BC and reverse 
link MAC have the same capacity region and the dual input covariance matrices can be calculated 
by different transformations. In |[52l , the dual input covariance transformation is derived based 
on the flipped channel and achieves the same rates as the forward links with equal or less power. 
The transformation cannot be calculated sequentially in general B-MAC networks, unless there is 
no loops in the interference graphs as discussed later. The MAC-BC duality can also be derived 
from the SINR duality as shown in [|53l . where a different dual covariance transformation is 
calculated from the MMSE receive beams and power allocation that makes the SINRs of the 
forward and reverse links equal. Such a transformation achieves equal or higher rates than 
the forward links under the same sum power. Because the transformation does not need to be 
calculated sequentially, it can be easily generalized to B-MAC networks as followed in this paper. 
Furthermore, the precoding matrices in the SINR duality based transformation can be designed 
such that there is no interference among the streams of a link ll55l . This orthogonal stream 
technique is directly applicable to B-MAC networks. The above MAC-BC duality assumes sum 
power constraint. It can be generalized to a linear constraint using minimax duality and SINR 
duality ll56l . 11571 . Efficient interior point methods have been applied to solve optimizations with 
multiple linear constraints in Il58ll . ||59ll , where the advantage is that the optimal primary variables 
and Lagrange multipliers are searched jointly. It is expected that exploiting optimal input structure 
will produce better algorithms that also work for more general networks, as discussed later. 

D. Contributions 

The following is an overview of the contributions of this paper. 

• Duality: As a simple consequence of the SINR duality, we show that the forward and 
reverse links of B-MAC networks have the same achievable rate region in Section IIII-AI 
The dual input covariance matrices are obtained from a transformation based on the MMSE 
filtering and the SINR duality. Different from the input covariance transformation in ll52l . 
which achieves the same rate as the forward links but with equal or less power and cannot 
be calculated sequentially for general B-MAC networks, the transformation in this paper 
achieves equal or larger rates with the same power and can be calculated easily for general 
B-MAC networks. We show that the two transformation coincide at the Pareto rate points. 
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• Polite Water-filling: The long sought-after network version of water-filling for all the Pareto 
optimal input of the achievable rate region of the B-MAC networks is found in Section Ull-B I 
Different from the traditional selfish water-filling, the polite water-filling optimally balances 
between reducing interference to others and maximizing a link's own rate in a beautifully 
symmetric form, while the traditional selfish water-filling only considers interference from 
others. Conceptually, it tells us that the optimal method to control the interference to others 
is through optimal pre-whitening of the channel of a link. It offers an elegant method to 
decompose a network into multiple equivalent single-user channels and thus, paves the way 
for designing/improving low-complexity centralized or distributed/game-theoretic algorithms 
for almost all network optimization problems, e.g., those in El, ESJ-Ell, ED, ED, ll46ll . 
Il60l . In fact, when imposing the optimal structure at every iteration of an algorithm, intuition 
and empirical results show that it is hard not to obtain a good solution even in non- 
convex cases. Consequently, general purpose convex or non-convex programing is no longer 
necessary. It also provides hints on the optimization of Han-Kobayashi transmission scheme 
0. The polite water- filling has the following structure. Consider link I with channel matrix 
Hi j and Pareto optimal input covariance matrix The equivalent input covariance matrix 
Qi = fij Xjfij is found to be the water- filling of the pre- and post- whitened equivalent 

— 1/2 ~ 1/2 

channel = f2 ; ' H^fij ' , where fi; is the interference-plus-noise covariance of the 
forward link, used to avoid interference from others. The physical meaning of fli, used to 
control interference to others, is two folded. In terms of the reverse link, it is the interference- 
plus-noise covariance of the reverse link, resulted from the optimal dual input. In terms of 
the forward link, it is the Lagrangian penalty for causing interference to other links. The 
deep connection to duality is that the optimal Lagrange multipliers work out beautifully to 
be the optimal dual reverse link powers. A surprising property is that the forward and reverse 
link equivalent input powers satisfy Tr(Q^) = Tr (Qi^j f° r a U individual links under total 
or other power constraints. This property will be useful for designing distributed algorithms. 
For the sum-rate optimal input, the polite water-filling does not reduce to the water-filling 
in j8]| because the polite water-filling is related to each individual link. 

• Extension to a Linear Constraint: We show that all results in the paper, including duality, 
polite water-filling structure, and algorithms, can be generalized from the case of a sum 
power constraint and white noise to the case of a linear constraint on all links' input 
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covariance matrices and colored noise in Section IIII-C1 The dual of the linear constraint 
in the forward link is the colored noise in the reverse link, the same as in multiaccess 
and broadcast channels Il56l , [1571 . The linear constraint result can be used as basis to 
handle multiple linear constraints, which arise in individual power constraints, per-antenna 
power constraints, and interference reduction to primary users in cognitive radios |[56ll . and 
therefore, is useful for many practical applications. More details on the extension of the 
algorithms will be discussed in Section |VU on future work. 

• Weighted Sum-Rate Maximization: The polite water-filling can be applied to most existing 
optimization problems of B-MAC networks. In Section|IV] the weighted sum-rate maximiza- 
tion is used as an example to illustrate its superiority. The lowest complexity per iteration 
of the designed algorithms is linear with respect to the total number of data links. They 
have superior accuracy and convergence speed, which, unlike generic convex programming, 
does not depend on the total number of data links. This is a result of enforcing optimal 
input structure by polite water-filling, where the interference is summarized by fli and fti 
no matter how many interfering links are there. 

• iTree Networks: The optimization for B-MAC networks is not convex in general. To an- 
alyze the algorithms and to provide a monotonically converging algorithm, we introduce 
a narrower class of networks, named Interference Tree (iTree) Networks in Section IIV-B1 
which seems worth studying in its own right. After interference cancellation, the remaining 
interference among links can be represented by a directional graph. If there is no directional 
loop in the graph, the network is called an iTree network, where one can find an indexing 
such that any link is not interfered by the links with smaller indices. It appears to be a logical 
extension of MAC and BC. An approach to making progress in network information theory 
is to study special cases such as deterministic channels ifTTI . ll6TTl and degree of freedom 
ED, [fTT|. Il62l . iTree networks looks promising in this sense. 

The rest of the paper is organized as follows. Section HI] defines the achievable rate region and 
summarizes the preliminaries. Section [III] presents the theoretical results on the duality and polite 
water- filling. As an example, polite water-filling is applied to weighted sum-rate maximization in 
Section [TV] where iTree networks is introduced for optimality analysis. The performance of the 
algorithms is verified by simulation in Section |V] The results of this paper provides a stepping 
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stone for solving more general problems, such as the optimization of Han-Kobayashi transmission 
scheme and cognitive radios, distributed optimization, and analysis of iTree Networks. They are 
discussed in Section |VH along with the conclusions. 



II. System Model and Preliminaries 

We first define the achievable rate region studied in the paper. It is followed by a summary 
of the SINR duality in It50ll . which will be useful for the derivation in the next Section. We use 
||-|| for L 2 norm and || - Jj x for L x norm. The complex gradient of a real function will be used 
extensively in the paper and is defined as follows. Define /(Z) : C MxN — > R. The extension of 
the results in Il63ll gives the gradient of /(Z) over Z as 

'df(Z 



V z / 



dZ 



where e C MxN is defined as 



df(Z) 



dZ 



df 
dziA 



df 



dZMA 



df 
dzi, n 



df 

dz M ,i 



and Zij jyi,]i q z 



df_ 



1 df 

2 dxi , 



3 df 

2 dyi,j 



Vi, j. If Z is Hermitian, it can be proved that the 



above formula can be used without change by treating the entries in Z as independent variables. 



A. Definition of the Achievable Rate Region 

We consider the most general one-hop MIMO interference network with multiple transmitters 
and receivers, which is a combination of interfering Broadcast and Multiaccess Channels. We 
call it a B-MAC network where each physical transmitter may have data for multiple receivers 
and each physical receivers may want data from multiple transmitters. There are L transmission 
links. Let TJ and Ri denote the virtual transmitter and receiver of link / equipped with L Tl and 
L Rl antennas respectively. The received signal at Ri can be expressed as 

L 

yi = ^H^.x fc + w ; , (l) 

k=l 

where x k £ C LT fc xl is the transmit signal of link k and is assumed to be circularly symmetric 
complex Gaussian; H^- e l C LR i y - LT k is the channel state matrix between T k and Ru and wj e 
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C Lr i x1 is a circularly symmetric complex Gaussian noise vector with zero mean and identity 
covariance matrix. 

To handle a wide range of interference cancellation, we define a coupling matrix $ £ M^ xL as 
a function of the interference cancellation scheme. It specifies whether interference is completely 
cancelled or treated as noise: if x fe , after interference cancellation, still causes interference to xj, 
<&i t k = 1 and otherwise, = 0. For example, when no interference cancellation technique is 
used, $ is a matrix of l's with zero diagonal entries. If the transmitters (receivers) of several links 
are associated with the same physical transmitter (receiver), interference cancellation techniques 
such as dirty paper coding (successive decoding and cancellation) can be applied at this physical 
transmitter (receiver) to improve the performance. 

Remark 1: The coupling matrices valid for the results of this paper are those for which 
there exists a transmission and receiving scheme such that each signal is decoded and possibly 
cancelled by no more than one receiver, because in the achievable rate region defined later, a rate 
is determined by one equivalent channel. In the Han-Kobayashi scheme, a common message is 
decoded by more than one receiver. Future extension to the Han-Kobayashi scheme is discussed 
in Remark U\ 

We give some examples of valid coupling matrices. For a BC (MAC) employing DPC (SIC) 
where the I th link is the I th one to be encoded (decoded), the coupling matrix is given by 
&i,k — 0, Vfc < I and = 1, VA; > I. In Fig. [Q we give an example of a B-MAC network 
employing DPC and SIC. Both T 2 and T 3 (i?x and R 2 ) are associated with the same physical 
transmitter (receiver). The following 3> a , <fr c , & d are valid coupling matrices for link 1, 2, 3 
under the corresponding encoding and decoding orders: a. x 3 is encoded after x 2 and x 2 is 
decoded after x x ; b. x 2 is encoded after x 3 and x 2 is decoded after x x ; c. x 3 is encoded after 
x 2 and x x is decoded after x 2 ; d. There is no interference cancellation. 
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Transmitters Receivers 




• I Interfering link ► 

I • 

Figure 1. Example of a B-MAC network 

Remark 2: When DPC and SIC are combined, it is possible that an interference may not be 
fully cancelled under a specific encoding and decoding order. For example, in Fig. [TJ if x 2 is 
encoded after x 3 and x 1 is decoded after x 2 , the interference from x 2 and x 3 to x 1 may not be 
fully cancelled JH, ||64l . Such case cannot be described by the coupling matrix of 0's and l's 
defined above. However, one may employ the results in this paper to obtain the inner and outer 
bounds of rates using pessimistic or optimistic coupling matrices. Another possible solution is to 
decompose the network into sub-networks, such as the methods proposed in flU, [|64ll where the 
two-user X channel is decomposed into two BCs or two MACs using zero-forcing and alternating 
optimization. Then each sub-network has a valid coupling matrix and the results in this paper 
can be applied. It will be an interesting future research to design more sophisticated coding 
techniques to fully cancel the interference from x 2 and x 3 to xi. 

Remark 3: The assumption of either full or no cancellation of each interference can be relaxed. 
All results in the paper still hold when the interference from link k to link / is partially cancelled, 
as long as the covariance matrix of the remaining interference has the form of H^Efcfijj., where 
Hz^ e C LR i xLT k can be any constant matrix. This is because H i fc can be treated as the equivalent 
channel from link k to link I. 

For the above B-MAC network, we define an achievable rate region. Unless explicitly specified, 
all achievable regions in this paper refer to the following one. Note that = by definition. 
The interference-plus-noise of the I th link is J2k=i ^i.k^-i.k^k + w h whose covariance matrix is 

L 

n, = I+^* {ifc H {ifc S fc Hf ifc , (2) 

k=l 

where S fc is the covariance matrix of x fc . We denote all the covariance matrices as 

S 1:L = (S 1; S 2 ,...,S L ). (3) 
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Then the mutual information (rate) of link I is given by a function of S 1:L and <fr 



X ; (S 1:L ,$) = log i + h^hJ/j; 



-1 



(4) 



Definition 1: The Achievable Rate Region with a fixed coupling matrix $ and sum power 
constraint Pt is defined as 



Assuming the sum power constraint is the necessary first step for more complicated cases. It 
has its own theoretical value and the result can be easily used for individual power constraints 
and the more general multiple linear constraints as discussed in Section IIII-CI and [VTl 

Definition 2: If [Xj (Si ; l, $)] z=1 . l is a Pareto rate point of 1Z& (Pt), the input covariance 
matrices E 1:i are said to be Pareto optimal. 

A bigger achievable rate region can be defined by the convex closure of U#es ^* (Pt), where 
H is a set of valid coupling matrices. For example, if DPC and/or SIC are employed, S can be a 
set of valid coupling matrices corresponding to various encoding and/or decoding orders. In this 
paper, we focus on a fixed coupling matrix $ and fixed channel matrices. The optimal coupling 
matrix or equivalently, the optimal encoding and/or decoding order of the weighted sum-rate 
maximization problem is partially characterized in Section IIV-AI It is straightforward to extend 
the results in this paper to random channel cases like fading channels. The reason to consider 
sum power constraints is given in Section IIII-CI 

The above definition ensures the following. 

Theorem 1: The boundary points of the region 1Z$, (Pt) are Pareto optimal. That is for each 
boundary point r, there does not exist another r' G 1Z& (Pt) satisfying r' > r and 3k, s.t. r' k > 
r k , i.e., all boundary rate points are strong Pareto optimums. 

The proof is given in Appendix [Al 

In Section Unl we will characterize the Pareto optimal input covariance matrices based on a 
rate duality between the achievable rate regions of the forward and reverse links. The reverse 
links are obtained by reversing the directions of the forward links and replacing the channel 
matrices by their conjugate transposes. The coupling matrix for the reverse links is the transpose 



A 





(5) 



^l:L--J2tl T ^l)< P T 

n < Xi (Si :ij *) , l < i<L}. 
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of that for the forward links. We use the notation " to denote the corresponding terms in the 
reverse links. For example, in the reverse links of the B-MAC network in Fig. [U T 2 /T 3 (Ri/R 2 ) 
becomes the receiver (transmitter), and x 2 is decoded after x 3 and Xi is encoded after x 2 , if in 
the forward links, x 3 is encoded after x 2 and x 2 is decoded after xi. The interference-plus-noise 
covariance matrix of reverse link I is 



I + *fc,iHl,i£* H *,i 



k=l 



And the rate of reverse link / is given by 



(6) 



I| Sl:I,* J 



log 



I + Hj^H^- 1 



For a fixed the achievable rate region for the reverse links is defined as: 



TZ^t(P t ) 4 |J {fel^: (7) 

fi < % (± 1:L , $ T ) , 1 < / < l} . 

B. SINR Duality for MIMO B-MAC Networks 

The above achievable rate region can be achieved by a spatial multiplexing scheme with 
successive interference cancellation (SIC) using minimum mean square error (MMSE) filtering 
and decoding. 

Definition 3: The Decomposition of a MIMO Link to Multiple SISO Data Streams is defined 
as, for link / and Mi > Rank(S ; ), finding a precoding matrix = [y/pijt^i, y/pijafiiMi] 
satisfying 

Tii = T;T| = y~]pi,mbl,mbt m , (8) 

m=l 

where t^. m e C iT ' xl is a transmit vector with ||tj )TO || = 1; and {pi tm } are the transmit powers. 
We also define p = \pi,i, ■■■,Pi,Mi, ••■■>Pl,i, ■■■,Pl,Ml\ T t0 be the transmit power vector of all 
links. 
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Note that the precoding matrix is not unique because T[ = T^V with unitary V G C MtxMl 
also gives the same covariance matrix in ([8]). Without loss of generality, we assume the intra- 
signal decoding order is that the m th stream is the m th to be decoded and cancelled. The receive 
vector r/ m G C Lr i x1 for the m th stream of link I is obtained by the MMSE filtering as 



Mi 



-1 



himaAMi + Q i H ^ 



(9) 



j=m+l 



where a^ m is chosen such that ||r; m || = 1. This is referred to as MMSE-SIC (MMSE Successive 
Interference Cancellation) receiver in this paper. 

For each stream, one can calculate its SINR and rate. For convenience, define the collections 
of transmit and receive vectors as 



T 
R 



m Jm=l,...,M;,«=l,...,L ' 
r l,m] m= i M i = i L ■ 



(10) 

(11) 



The cross-talk matrix ^ (T, R) G R+' M[X ^ 1 Ml between different streams ll46l is a function 
of T, R, and, assuming unit transmit power, the element of the ^X^Ii % + rri^j row and 

( k—l \ th 

( J2i=i Mi + n) column of St is the interference power from the k th link's n th stream to the 
I th link's m th stream and is given by 



k,n 





$>Lk 



r l,rJll,ktk,n 



k = I and m > n, 
k — l, and m < n, 
otherwise. 



(12) 



Then the SINR and the rate for the m th stream of link / can be expressed as a function of T, 
R and p, 



7 z, m (T,R,p) 



Pl,m \ r Lm 



r /.. m .H W t; : 



(T,R,p) 



L M k 
fc=ln=l 

log(l + 7 , m (T,R,p)) 



(13) 



(14) 



Such decomposition of data to streams with MMSE-SIC receiver is information lossless due 
to the following fact. 
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Fact 1: The mutual information in © is achieved by the MMSE-SIC receiver [65], i.e., it is 
equal to the sum-rate of all streams of link /: 

Mi 

rl = $>, m (T,R,p) (15) 

m=l 

= Z,(£ 1:L ,$). 

In the reverse links, we can obtain SINRs using R as transmit vectors and T as receive 
vectors. The transmit powers is denoted as q = [q^i, g^Mi, —,Ql,i, Ql,m l ] 7 '• The decoding 
order of the streams within a link is the opposite to that of the forward link, i.e., the m th stream 
is the m th last to be decoded and cancelled. Then the SINR for the m th stream of reverse link 
I can be expressed as 



% m (R, T, q) 



L M k 
fc=ln=l 



(16) 



For simplicity, we will use {T, R, p} ({R, T, q}) to denote the transmission and reception 
strategy described above in the forward (reverse) links. 

The achievable SINR regions of the forward and reverse links are the same. Define the 
achievable SINR regions 7* (Pt) and 7$t (P t ) as the set of all SINRs that can be achieved 
under the sum power constraint Pt in the forward and reverse links respectively. For given set of 
SINR values 7 ° = \j?J , „ , , r , define a diagonal matrix D (T, R, 7 °) e MlXj2 ' Ml 

i l rn=l,...,Mi,l=l,...,L ° ' ' y t 

being a function of T, R and 7 , where the ( J2i=i Mi + m) diagonal element is given by 



(17) 



The SINR duality in [|50l is restated in the following lemma. 

Lemma 1: If a set of SINRs 7 is achieved by the transmission and reception strategy {T, R, p} 
with ||p || j = P T in the forward links, then 7 is also achievable in the reverse links with 
{R, T, q}, where the transmit power vector q satisfies || q|| x = Pt and is given by 

q = (D-^T.R^-tf^T.R))" 1 !. (18) 

And thus, one has 7i (Pt) = T&t (Pt). 
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III. Theory 



In this section, we first establish a rate duality, which is a simple consequence of the SINR 
duality in ||50ll . Then based on the rate duality, the polite water-filling structure and properties of 
the Pareto optimal input are characterized. Finally, we discuss the extension from a sum power 
constraint to a linear constraint and the extension from white noise to colored noise. 

A. A Rate Duality for B-MAC Networks 

The rate duality is a simple consequence of the SINR duality. 

Theorem 2: The achievable rate regions of the forward and reverse links of a B-MAC network 
defined in © and © respectively are the same, i.e., (P T ) = TZ^t (Pt)- 

Proof: For any rate point r in the region 1Z<s> (Pt) achieved by the input covariance matrices 
S 1;i , the covariance matrix transformation defined below can be used to obtain S 1:L for the 
reverse links such that a reverse link rate point f > r under the same sum power constraint Pt 
can be achieved, according to Lemma [2l The same is true for the reverse links. Therefore, we 



Definition 4: Let S ; = 2~2 m Li Vi,mXi,mX} m , I = 1, be a decomposition of E i: £. Compute 
the MMSE-SIC receive vectors R from © and the transmit powers q in the reverse links from 
CDS). The Covariance Transformation from S 1:L to Si : jr is defined as 



And Si : L is called the dual input covariance matrices of Ei : £. 

Lemma 2: For any input covariance matrices S 1:L satisfying the sum power constraint P T 
and achieving a rate point r £ (Pt), the corresponding dual input covariance matrices Si :i 
achieves a rate point f > r in the reverse links under the same sum power constraint. 

Proof: According to fact[TJ r is achieved by the transmission and reception strategy {T, R, p} 
with HpIIj = ^ ;=1 Tr(Si). It follows from Lemma[[]that {R, T, q} with || q|| x = X^iTr(E;) 
can also achieve the same rate point r in the reverse links. Note that T may not be the MMSE- 
SIC receive vectors, which is optimal for the reverse links. Therefore, the reverse link rates may 
be improved with a better receiver to obtain ? > r. ■ 

The following corollary follows immediately from Lemma [2] and Theorem |2j 



have TZ^ (P T ) = TZ^t (P T ). 



Mi 




(19) 



m=l 
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Corollary 1: For any input covariance matrices S 1:L achieving a Pareto rate point, the corre- 
sponding dual input covariance matrices Xi :L achieves the same Pareto rate point in the reverse 
links. 

The following makes connection of the covariance transformation (TT9b to the existing ones. 
First, it is essentially the same as the MAC-BC transformations in ll56l and 11551 which are also 
based on SINR duality. The only difference is that we do not restrict the choice of the precoding 
matrix T; in ([8]). In 11561 . the precoding matrix T/ is chosen as the eigenvectors of £/. And in 
11551 . the precoding matrix T; is chosen to decorrelate each link I such that the streams of link 
/ are orthogonal to each other. Second, we show that the MAC-BC transformation in l!52l is 
equivalent to the covariance transformation in (fT9l) at the Pareto rate point. 

Theorem 3: For any input covariance matrices S 1:L achieving a Pareto rate point, the dual 
input covariance matrices Ei : £ produced by the covariance transformation (fT9l) are the same as 
that produced by the MAC-BC transformation in 11521 . i.e., 

S ; = fi- 1/2 FiGjri; /2 S^; /2 G ; Fjfi- 1/2 , I = 1, L, (20) 

where G C Lr ' xNl , G; G C Lt ' xNi are obtained by the thin singular value decomposition 
(SVD): fi~ 1/2 H M ri~ 1/2 = FjAjGj; and N t is the rank of H M . 

Proof: The proof relies on Theorem [5] in Section IIII-BL which states that at the Pareto 
rate point, both S 1:i and its dual input covariance matrices S 1:i satisfy the polite water-filling 
structure as in (T29l) and (l30l) respectively. Then we have 

n) /2 %n] 12 = f^f; 

= F,GfG,D,GfG,F; 

where the first and last equations follow from (|29b and (|3Q|) respectively. ■ 
Remark 4: However, at the inner point of the rate region, the above two transformations 
are different. The transformation in (fl9l) keeps the sum transmit power unchanged while not 
decreasing the achievable rate, while the MAC-BC transformation in ||52l keeps the achievable 
rate the same while not increasing the sum transmit power. Furthermore, the extension of MAC- 
BC transformation in [|52l to B-MAC networks can only be sequentially calculated for iTree 
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networks defined in Section |IV-B[ a sub-class of B-MAC networks whose interference graph has 
no loops. 

B. Characterization of the Pareto Optimal Input 

In the following, we show that the Pareto optimal input covariance matrices have a polite 
water-filling structure. It generalizes the well known water- filling solution to networks. For 
single-user channel, the polite water-filling structure reduces to the conventional water-filling 
solution. First, we formally define the term polite water-filling structure. 

Definition 5: Given input covariance matrices obtain the dual input covariance matrices 
Si ; L by the covariance transformation in (fT9l) . Let fVs and fVs respectively be the corresponding 
interference-plus-noise covariance matrices in the forward and reverse links. For each link I, 
pre- and post- whiten the channel to produce an equivalent single-user channel = 

1/2 '*' 1/2 /\ I/2 *** 1/2 

H^f2j . Define Qi = S;f2/ as the equivalent input covariance matrix of the link 
/. The input covariance matrix is said to possess a polite water-filling structure if Q; satisfies 
the structure of water-filling over H^, i.e., 

Qt = G^G], (21) 

where v\ > is the water-filling level; the equivalent channel of link I is decomposed using the 
thin SVD as H, = FjAjGj, where F; e C Lr ' xNi , Gi e C Lt i xNi , A t e R+ l * Nl ; and N t is the 
rank of Hj^. If all Ej's possess the polite water-filling structure, then S 1:i is said to possess the 
polite water-filling structure. 

For B-MAC, the polite water-filling structure of the Pareto optimal input can be proved by 
observing that the optimal input covariance matrix for each link is the solution of some single-user 
optimization problem. We use the notation ~ for the optimal variables corresponding to a Pareto 
rate point. Without loss of generality, assume that = J2m=iPi,rnXi,rnX} m J = !>•••, L, where 



Pi tm > 0, V/,m, achieves a Pareto rate point 



Ti >0 



^ l and £ t = YJ£=i Qi,mri, m rl m , I 



1, L axe the corresponding dual input covariance matrices. For the Pareto rate point where the 
rates of some links are zero, the proof also holds by simply deleting these links and applying 
the proof for the resulting smaller network. The corresponding interference-plus-noise covariance 
matrices are denoted as Qi and fli respectively. Then it can be proved by contradiction that the 
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Pareto optimal input covariance matrix is the solution of the following single-user optimization 
problem for link /, where the transmission and reception schemes of other links are fixed as the 
Pareto optimal scheme, {p k m , t k m , f fc m , k ^ /}. 



minTr 



s.t. 



log 



m 



I + H^SjH^n," 
Tr ( S,A« m ) < Tr (E,Aj^ 
l,...,M k ,k = l,...,L,k^l, 



(0 



(22) 



(23) 



where A^ m = $fe,jH^ ; r fcim ?£ m H fc ^; and Tr ^SjA^ m J is the interference from link / to the m th 
stream of link k and is constrained not to exceed the optimal value Tr ( £j Aj? ) . The constraints 



force the rates to be the Pareto point while the power of link / is minimized. The Lagrangian 
of problem (l22l) is 



Li (A, v h 0, Ej) 

-Tr (E, (A, (A) - ©)) - E E A <^ Tr (^A^) 



k^l m=l 



where the dual variables v\ G 



and A 



[A 



k,m\ 



(24) 



lm=l,...,M k ,k^l ^ ^+ 

with the rate constraint and the interference constraints in (|23l respectively; Aj (A) = I + 
Ylk^i Sm=i Afc, m A^ m is a function of A; is the matrix dual variables associated with the 



positive semidefiniteness constraint on Ej. Note that problem (|22l) is convex, and thus, the 
duality gap is zero ||66l and E ; minimizes the Lagrangian (1241) with optimal dual variables 



and A 



A fc 



k,m 



m=l,...,M k ,k^l 



i.e., it is the solution of 



min Tr E Z A, ( A 



Hog 



I + HyEjHfjn,- 1 



(25) 



Note that in the objective function of problem (1231 ), the constant terms in the Lagrangian 
(I2"4l) have been deleted for simplicity of notations, and the term associated with the positive 
semidefiniteness constraint Tr(E z 0) is explicitly handled by adding the constraint E z y 0. 



The following theorem states that the physical meaning of the optimal dual variables 
is exactly the optimal power allocation in the reverse links. 



A 



k,m 



k^l 
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Theorem 4: The optimal dual variables of problem (1221) are given by 



Afc, m = q k , m ,m = 1, ...,M k ,Vk ^ I (26) 

2 

qi,mPl,m (1+7; 



7/ m 



-,Vm (27) 



where 7z im is the SINR of the m th stream of forward link / achieved by {pk,m, tfc,m, *k,m}- 
Therefore, 

A, (X) = A ; ([&,„»] = hi. 
The proof is given in Appendix [Bj 

The polite water-filling structure is shown by a single-user-channel view using the above 

~~ ~ — 1/2 A —1/2 A 1/2 A 1/2 

results. Let = ' H; ' and Q ; = f2- S{f2j respectively be the equivalent channel 
and the equivalent input covariance matrix for link I. Since tli is non-singular, problem (1251) is 
equivalent to a single- user optimization problem 

min Tr (Q,) - z>dog I + H,Q,fif , (28) 

of which Cl l J 2 'TiiCl l J 2 is an optimal solution. Since the optimal solution to problem (1281) is unique 
and is given by the water- filling over with u t as the water- filling level 11671 . the following 
theorem is proved. 

Theorem 5: For each I, perform the thin SVD as H, = F z A,Gf, where Fi £ C Lr i xN \ Gi £ 
C Lt i xNi , A t £ R^^, and N t is the rank of H^. At a Pareto rate point, the input covariance 
matrix S; must have a polite water-filling structure, i.e., the equivalent input covariance matrix 
Qi = f2j 2 53{f2j satisfies 

= GAGj, (29) 

D, = (^I-A r 2 ) + . 

Similarly, the corresponding S z produces Q; = J7^ 2 £zf2^ 2 , which satisfies 

4 = PjDjFj. (30) 

Remark 5: The insight given by the proof of Theorem [5] is that restricting interference to 
other links can be achieved by pre-whitening the channel with reverse link interference-plus- 
noise covariance matrix. And thus, the mutual interfering B-MAC can be converted to virtually 
independent equivalent channels H;, I — 1, L. 
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Remark 6: The restriction of interference to others is achieved in two steps. First, in = 

~ 1/2 A 1/2 A 1/2 

fi ; ' Hj jflj , the multiplication of fij 7 reduces the channel gain in the interfering directions 

~ ~"^_]/2**'^ 1/2 

so that in Q^, less power will be filled in these directions. Second, in = ' Q,iQ [ , the 

~ —1/2 

power to the interfering directions is further reduced by the multiplication of fl, l . 

Remark 7: The Lagrangian interpretation of Cli makes it possible to extend the duality and 
polite water-filling to Han-Kobayashi transmission scheme. Cancelling the interference requires 
the interference power to be greater than a threshold rather than less than it. Therefore, some 
Lagrange multipliers are negative in A; (A). If we still interpret the Lagrange multiplier as reverse 
link power, we must introduce the concept of negative power. With the new concept of negative 
power and the modified coupling matrix, one may establish the duality and polite water-filling for 
Han-Kobayashi scheme. The matrix fli likely remains positive definite. Otherwise, the solution 
to problem (T25T) has infinite power, which suggests there is no feasible power allocation to satisfy 
all the constraints. 

Theorem \5\ says that at the Pareto rate point, it is necessary that and S 1;L have the 
polite water-filling structure. The following theorem states that having the polite water-filling 
structure suffices for to have the polite water-filling structure even at a non-Pareto rate point. 
This enables the optimization of the network part by part. A lemma is needed for the proof and 
reveals more insight to the duality. Although the covariance transformation preserves total power 
such that Ya=i T f (^i) = Yla=i Tr > m general, Tr (£;) = Tr is not true. Surprisingly, 
Tr(Qi) = Tt(q { ) , V/ is true. 

Lemma 3: Let Si : £ be the covariance matrices obtained from by the transformation in 
Definition |U Define the equivalent covariance matrices Q ; = f2^ 2 E;f^ 1//2 and Qz = f2^ 2 S;S7^ 2 . 
The power of the forward and reverse link equivalent covariance matrices of each link is equal, 
i.e., Tr(Qj) = Tr (q { ) , V/. 

The proof is given in Appendix O 

Theorem 6: If one input covariance matrix has the polite water-filling structure while other 
S fc ,Sfc, k 7^ / are fixed, so does its dual Sj, i.e., = fi^SjQ^ 2 is given by water-filling 
over the reverse equivalent channel fij = fij -1 ' 2 Hj { fij 2 as in (l30l) . 

Proof: Because water- filling uniquely achieves the single-user MIMO channel capacity Il671l . 
Q; achieves the capacity of H^. Since the capacities of and H[ are the same under the same 
power constraint ll67l . achieves the capacity of H} with the same power by Lemma [3] and 
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Lemma [21 Therefore, Qi is a water- filling over Hj. ■ 
Decompose a MIMO problem to SISO problems using beams can often reduce the complexity. 
But for different decompositions of the input covariance matrices £i : l, {r^ m } and {qi, m } are 
different in general, and thus the covariance transformation may also be different. However, as 
shown by the theorem below, the covariance transformation of (fT9b is unique and has an explicit 
matrix expression if the input covariance matrices have the polite water-filling structure. 

Theorem 7: For any input covariance matrices Ei : £ satisfying the polite water-filling structure, 
the covariance transformation (fl9l ) is unique, i.e., for all decompositions of E 1:£ ,, it will be 
transformed to the same dual input Ei :i , and vice versa. Furthermore, the dual input covariance 
matrices Ei : £ satisfies the following matrix equation 

n,- 1 !^,!!,,, = EjH^n,- 1 ^,,, l = l,...,L, 
and can be explicitly expressed as 

= vi for 1 - (hmSjhJ, + n,)" 1 ^ ,1 = 1, 

where v\ is the water-filling level in the polite water-filling structure of X; in (|2l]>. 

The proof is given in Appendix |D] The matrix equation in (I3TI) is natural. At the Pareto rate 
point, the covariance transformation will give the same set of rates for the forward and reverse 
links. Hence we have 



(3D 



(32) 



log 
log 



I + HfjEjHyOf 1 



I + n^HfjEjH 



u 



log 
log 



I + HyE^Of 1 

i + ^ujp^hi 



(33) 



Theorem [7] shows that not only the determinant but also the corresponding matrices are equal, 
i.e., n^HfjSjHf,, = EjH^nf 1 !!,,,. 



C. Extension to a Linear Constraint and Colored Noise 

So far we have assumed sum power constraint X^iTr(Ei) < Pt and white noise for 
simplicity. But individual power constraints is more common in a network, which can be easily 
handled by the following observation. All results in this paper can be directly applied in a 



much larger class of problems with a linear constraint Y^i=i ^ r ( E;W; ) < P T and/or colored 



noise with covariance E 



W;, where W; and W; are assumed to be non-singula 



,0 



'A singular constraint or noise covariance matrix may result in infinite power or infinite capacity. 
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and Hermitian. The single linear constraint appears in Lagrangian functions for multiple linear 
constraints, which arise in cases of individual power constraints, per-antenna power constraints, 
interference reduction in cognitive radios, etc. ||56ll , 11571 . Combined with a simple Lagrange 
multiplier update, the algorithms in this paper can be generalized to solve the cases of multiple 
linear constraints. 

For a single linear constraint and colored noise, we denote 

[Hj jfc ] , Yl Tr ( S ^0 ^ Pr > t w <] ) ' (34) 
1=1 J 

as a network where the channel matrices is given by [H^]; the input covariance matrices must 
satisfy the linear constraint J2t=i Tr ^Wij < Pt', and the covariance matrix of the noise at 
the receiver of link / is given by W;. If some links have the same physical receiver and thus the 
same noise, the noise covariance matrices of these links are implicitly assumed to be the same. 
Similarly, if some links have the same physical transmitter, the linear constraint matrices W|'s 
associated with these links are also assumed to be the same. The extension is facilitated by the 
following lemma which can be proved by variable substitutions. 

Lemma 4: The achievable rate region of the network ((34)) is the same as the achievable rate 
region of the network with sum power constraint and white noise 

wr v x fe w fe - l/2 ] , Tr ( s < p T> m ) • (35) 
i=i j 



If S 1:L achieves certain rates and satisfies the sum power constraint in network (1351) . Hi-.l 

•*> j/2 i A 1/2 

obtained by Ti t = W ; 1 5^W ; , V/ achieves the same rates and satisfies the linear constraint 



in network (1341) and vice versa. 

The above implies that the dual of colored noise in the forward link is a linear constraint in 
the reverse link and the dual of the linear constraint in the forward link is colored noise in the 
reverse link as stated in the following theorem. 

Theorem 8: The dual of the network (|34l) is the network 



L 

i=i 



w. 



(36) 



in the sense that 1) both of them have the same achievable rate region; 2) If Xi : £ achieves 
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certain rates and satisfies the linear constraint in network (1341) , its covariance transformatiorO 
achieves better rates, satisfies the dual linear constraint in network (l36l) . and Yla=i T f 

£f =1 Tr(E,Wj) <P T . 

Proof: Apply Lemma |4] to networks (|34l) and (|36l ) to produce a network and its dual with 
sum power constraint and white noise. Then the result follows from Lemma [2l ■ 
Remark 8: For the special case of a BC with a single linear constraint, the duality result here 
reduces to that in ||56l . 

IV. Algorithms 

In this section, we use the weighted sum-rate maximization of B-MAC networks as an example 
to illustrate the benefit of polite water-filling. We first present the simpler case of a sub-class 
of B-MAC networks, the interference tree (iTree) networks, with a concave objective function. 
Then the algorithm is modified to find sub-optimal but good solutions for the general B-MAC 
networks. Readers who are interested in algorithms implementation only may directly go to 
Section ITV-Dl and read Table M for Algorithm PT and Table HH for Algorithm PP. Algorithm P 
and PI are more of theoretic value. 

A. The Optimization Problem 

We consider the following weighted sum-rate maximization problem (WSRMP) with a fixed 
coupling matrix <fr, 

WSRMP: max / (S 1; S L , $) (37) 

s.t. /(Ex, 

L 

^ J>^(£ 1:i ,<&), 

1=1 

h 0, V/, 

L 

£>(Sz) <Pt, 
i=i 

2 The covariance transformation for this case is also calculated from the MMSE receive beams and power allocation that makes 
the SINRs of the forward and reverse links equal, just as in d 1 9b - The only difference is that the identity noise covariance in 
fli is replaced by W; and the all-one vector 1 in l l 1 .St is replaced by the vector t[ m W;t;, m 

L ' ' J m=l,...,M, ,1=1 L 
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where wi > is the weight for link /. 

For the special case of DPC and SIC, we give a partial characterization of the optimal or 
equivalently, the optimal encoding/decoding order. It is in general a difficult problem because the 
encoding and decoding orders at the BC transmitters and the MAC receivers need to be solved 
jointly. However, for each Pseudo BC transmitter! Pseudo MAC receiver defined below in the 
B-MAC network, the optimal encoding/decoding order maximizing g($*) is easily determined 
by the weights and is consistent with the optimal order of an individual BC or MAC, as proved 
in Theorem [9] below. 

Definition 6: In a B-MAC network, a physical transmitter with a set of associated links, whose 
indices forms a set £ B , is said to be a Pseudo BC transmitter if either all links in £ B completely 
interfere with a link k or all links in £ B do not interfere with a link k, VA; G £ B , i.e., the columns 
with indices in £ B of the coupling matrix excluding rows in £ B , are the same. A physical 
receiver with a set of associated links, whose indices forms a set Cm, is said to be a Pseudo 
MAC receiver if either all links in £ M are completely interfered by a link k or all links in C M 
are not interfered by a link k, VA; G C M , i.e., the rows with indices in £ M of the coupling matrix 
excluding columns in £ M , are the same. 

For example, if V/ G £ B , link / is the last one to be decoded at its receiver, then the 
corresponding physical transmitter is a pseudo BC transmitter. If V/ G C M , link I is the first 
one to be encoded at its transmitter, then the corresponding physical receiver is a pseudo MAC 
receiver. 

Theorem 9: In a B-MAC network employing dirty paper coding and successive decoding and 
cancellation, if there exists a pseudo BC transmitter (pseudo MAC receiver), its optimal encoding 
(decoding) order n of the following problem 

max, 0(*OO) (38) 

is that the signal of the link with the n th largest (smallest) weight is the n th one to be encoded 
(decoded). 

Proof: It is proved by isolating the links associated with a Pseudo BC transmitter or a 
Pseudo MAC receiver from the network to form an individual BC or MAC. Let £ B be a set of 
links associated with a pseudo BC transmitter. In the optimal solution of (1381) . : / G £ B } 
and 7r are also the optimal input and encoding order that maximizes the weighted sum-rate of 
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a BC with fixed interference from links in and under multiple linear constraints that the 
interference to links in £g must not exceed the optimal values. The known result on BC with 
multiple linear constraints in [|68l implies that the optimal encoding order is as stated in the 
theorem. For a set of links belonging to a pseudo MAC receiver, using similar method and 
generalizing the result in Section IIII-CI to multiple linear constraints gives the desired decoding 
order. ■ 

All the rest of the paper is for a fixed coupling matrix $ and the argument <fr in / (Si, S^, 
is omitted. In the simulation, DPC and SIC are employed and the encoding/decoding order is 
chosen to satisfy Theorem |9j We consider centralized algorithms with global channel knowledge. 
Distributed algorithms with partial knowledge can be developed based on them. More is discussed 
in Section |VU 

B. iTree Networks 

iTree networks appears to be a natural extension of MAC and BC. We define it below. 

Definition 7: A B-MAC network with a fixed coupling matrix is called an Interference Tree 
(iTree) Network if after interference cancellation, the links can be indexed such that any link is 
not interfered by the links with smaller indices. 

Definition 8: In an Interference Graph, each node represents a link. A directional edge from 
node i to node j means that link i causes interference to link j. 

Remark 9: The iTree network is related to but different from the network with tree topology 
in terms of nonzero channel gain. A network with tree topology implies iTree network only if the 
interference cancellation order is chosen properly. For example, a MAC which has tree topology 
is not an iTree network if the successive decoding is not employed at the receiver. Similarly, a 
BC is not an iTree network if dirty paper coding is not employed. On the other hand, even if 
there are loops in a network, it may be an iTree network if the interference cancellation order 
is right. We give such an example in Fig. [2] where there are three physical transmitters and 
three physical receivers with four transmission links 1, 2, 3, and 4, and dirty paper coding and 
successive decoding and cancellation are employed. Let xj be the transmit signal of link I. For 
encoding/decoding order A, at the physical receiver corresponding to R\ and R 2 , the signal x 2 
is decoded after Xi. At the physical transmitter corresponding to T 2 and T 3 , the signal x 3 is 
encoded after x 2 . With this encoding and decoding order, each link I G {2, 3, 4} is not interfered 
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Transmitters Receivers 




Encoding/Decoding Order A Encoding/Decoding Order B 

Figure 2. A network with topology loop in terms of nonzero channel gain and encoding/decoding order A is an iTree network, 
whose interference graph does not have any directional loop. With encoding/decoding order B, it is not an iTree network because 
the interference graph has directional loops. 



by the first / — 1 links. Therefore, the network in Fig. [2] is still an iTree network even though 
it has a loop of nonzero channel gains. However, for encoding/decoding order B, SIC is not 
employed at Ri/R 2 , and x 2 is encoded after x 3 at T 2 /T 3 . The network in Fig. [2] is no longer an 
iTree network because the interference graph has directional loops, making the iTree indexing 
impossible. 

iTree networks can be equivalently defined using their interference graphs. 

Lemma 5: A B-MAC network with a fixed coupling matrix is an iTree network if and only 
if after interference cancellation, its interference graph does not have any directional loop. 

Proof: If the interference graph has no loops, the following algorithm can find indices 
satisfying the definition of the iTree network: 1) I — 1; 2) index a node whose edges are all 
incoming by /; 3) delete the node and all the edges connected to it; 4) let / = / + 1 and repeat 
2) until all nodes are indexed. On the other hand, the interference graph of an iTree network 
only has edges from a node to the nodes with lower indices, making them impossible to form 
a directional loop. ■ 

Since the coupling matrix of the reverse links is the transpose of the coupling matrix of the 
forward links, the interference relation is reversed as stated in the following lemmas. Without 
loss of generality, we consider iTree networks where the I th link is not interfered by the first I — 1 
links in this paper, i.e., the coupling matrix is given by :k = 0,\fk < I and 3?^ = l,\fk > I. 

Lemma 6: If in an iTree network, the I th link is not interfered by the links with lower indices, 
in the reverse links, the I th link is not interfered by the links with higher indices. 

The following is obvious. 
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Lemma 7: The interference graph of the reverse links can be obtained from the interference 
graph of the forward links by reversing the directions of all the edges. 

Two examples of iTree networks with concave weighted sum-rate functions are as follows. 
An obvious example is the MAC with the decoding order equal to the ascending order of the 
weights {wi}. The second example is the two-user Z channel, where the cross link channel gain 
satisfies H 2 ,i = 0, providing that some further conditions are satisfied as stated in the following 
theorem. 

Theorem 10: The weighted sum-rate of a two-user Z channel is concave function of the input 
covariance matrices if the following are satisfied. 

1) The channel matrices H\ 2 and H 2 2 are invertible and satisfy Hi, 2 H 2j2 b H{ 2 H 12 . 

2) The weights satisfy wi < w 2 . 

The proof is given in Appendix [0 The important question of a complete characterization of 
the subset of iTree networks with concave weighted sum-rate function, i.e., finding the necessary 
and sufficient conditions on H fc j and <fr for the subset, is out of the scope of this paper and is 
left for future work. 

C. Algorithms for iTree Networks with Concave Objective Functions 

The algorithm in this section illustrates the usage of polite water-filling and is a nontrivial 
generalization of the algorithm in [9J . First we show that WSRMP (1371) of an iTree network with 
concave objective function /(•) can be equivalently solved by the following convex optimization 
problem. 

max/ mod (E(l),E(2),..,E(L)) (39) 

S(1:L) 

L 

s.t.Vfc, /, E; (k) h 0, and VA; Tr ( S * ( k )) ^ p t, 

i=i 

where S (k) = (Si (k) , S L (k)) for k = 1,...,L with S; (k) e C Lt i xLt l The objective 
function is defined as 

/ mod (S(l),S(2),...,S(L))^ 
1 L 

2 J2f (Si ([1 - i] L ) , E, ([/ - i] L ) , S L ([L - z] L ) ) , 

i=l 
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where [n] L = (n mod L) + 1. 

The physical meaning of the optimization problem (|39| ) is that it is a weighted sum-rate 
maximization problem of L networks, all of which are identical to the network in the original 
problem in 071) . The purpose of expanding the single-network optimization problem in ( ITTT ) 
to the multiple-network optimization problem in (|39| ) is to decouple the power constraints so 
that the input covariance matrices of a network belong to different power constraints and the 
input covariance matrices of a power constraint belong to different networks. In problem (l39l) . 
the update of the input covariance matrices of a power constraint is easier because there is 
no interference among the networks, and other input covariance matrices not belonging to this 
power constraint can be treated as constants, resulting in monotonically convergent iterative 
algorithm. A similar method is also used in [|9]|, ETTl . We use the subscript i to denote the 
terms corresponding to the i th network, e.g., the input covariance matrices for the i th network 
is denoted as Eo^ = (£i,i, Ej jL ). The one-to-one mapping between (£ij ;L , £l,i : l) and 
(E(1),...,£(L)) is given by 

£ M = E,([Z-i]J, (40) 
E,(fc) = E p _ fc]z(){j Vi,Me{l,...,L}. 

Then we have 

1 L L 

/ mod (£ (1) , E (2) , E (L)) = - E WlZ ^ ( S *.i^> *) • 

8=1 {=1 

Note that the power constraint Xl^iTr(S i (k)) < Pp, Wk couples the networks together. The 
following example illustrates the mapping in (|40"1) for L = 4: 

1 th link 2 th link 3 th link 4 th link 

1 th network E x (1) E 2 (2) E 3 (3) E 4 (4) 

2 th network E x (4) E 2 (1) E 3 (2) E 4 (3) , 

3 th network Ei (3) E 2 (4) E 3 (1) E 4 (2) 

4 th network E x (2) E 2 (3) E 3 (4) E 4 (1) 

where the element at the / th column and i th row is equal to E^, the input covariance matrix for 
the I th link of the i th network. 

The following lemma holds for / mo d(-)- 
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Lemma 8: The function / mod (-) satisfies the following properties: 

1) Let E, y 0, V/, satisfy Ef=i Tr ( s «) < p t- The mapping E (k) = Ei :L , Vfc, results in 
£f =1 Tr(E z (fc)) < P T , VA, and / mod (E(l : L)) satisfying / mod (E 1:L , E 1:L , E 1:L ) = 

2) Let £,(Jfc) b 0, VZ,fc, satisfy ^ Tr (Si (fe)) < P T , \/k. The mapping E 1:L = (1/L) 
£)jj =1 E (A) results in J]f = i Tr (E ; ) < P T and / (E 1;L ) satisfying 

/ mod (E(l),E(2),...,E(L)) 
< /(Si*)- 

Proof: The first property is obvious and the second property holds because 

/ mod (E(l),E(2),...,E(L)) 

^ /(|Esi(*).-^E E ^(*)) (41) 

V fe=i fc=i / 

where (|4T1) follows from the concavity of /(•). ■ 
The lemma says that the problems (|37T) and (|39| ) are equivalent and every optimal solu- 
tion E (1) , E (2) , E (L) of problem (|39l ) maps directly to the optimal solution E 1:L = 

(1/L) ELi S of the WSRMP (H7J. 

We first obtain insight of the problem by finding the necessary and sufficient conditions 
satisfied by the optimum. With the insight, we design an algorithm which monotonically increases 
the objective function until convergence to the optimum. 

Theorem 11: Necessity: If E (1) , E (L) is an optimum of problem (|39l ), then VA;, E (A;) 
must satisfy the following optimality conditions: 

1) For any 1 < I < L, E z [k] possesses the polite water-filling structure as in Defini- 
tion (5J i.e., firj^fci i^i (^) ^[i~k] i * s S^ ven by water-filling over the equivalent channel 
il [i-k] L ,i"-hi ll [i-k] L ,i- 

2) The water- filling level for E^ (k) is given by = wi/fl, where Pi does not depend on k 
and pt, > is chosen such that Ylf=i ^ r {^i (^)J = Pt- 

Sufficiency: If certain E (1) , E (L) satisfies the above optimality conditions, then it must 
satisfy the KKT conditions of problem (|39l ), and thus is the global optimum of problem (|39l) . 

We go over the proof in detail as it helps design the algorithm. It can be proved by contradiction 
that the optimum E (1) , E (L) must satisfy Yld=i T f y^i (^) 1 = Pt, Vfc. Otherwise, if 
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Yla=i T f {^i (k)J < Pt for some k, we can strictly improve the rate of the first link in the 
([1 — k] L ) th network by increasing the power of S x (k) until ^2f =1 Tr (j^i (k)j = Pt- All other 
rates are not affected because the first link causes no interference to other links. Then the objective 
function can be strictly increasec^l, which contradicts that E (1) , £ (L) is an optimum. 

The remaining necessity part is proved by showing that if for some k, £ (k) does not satisfy 
the polite water-filling structure in Theorem HH the objective function can be strictly increased 
by enforcing this structure on S (k). Without loss of generality, we only need to prove this for 
k — 1 due to the circular structure of / m od(-)> 

/ mod (£(l),£ (2) ,..,£(£)) 

= / mod (£ (£),.. .,E(L),£(1),...,E (A; -1)). 

We define some notations and give two useful lemmas. For the i th network, fixing the input 
covariance matrices Sjj, j = i + 1,...,L for the last L — i links, the first i links form a 
sub-network 

^.*] w=w ' E Tr ( S ^) = P T' [Wd^w ) , ( 42 ) 



1=1 



where Wjj = I + X^=i+i ^jHjjSjjH^ is the covariance matrix of the equivalent colored 
noise of link I; P T = Y^)=i Tr ^ * s obvious that this sub-network is still an iTree network. 

By Theorem [8} the corresponding dual sub-network is 

Hl,J .>E Tr (^ W m) = , J • (43) 

' i=i J 

Denote X^ (Sj <&) and X, ^ 3> T j the forward and reverse link rates of the I th link of 

the i th sub-network (1421) achieved by Sj 1; j and Sj^.j respectively. In contrast, X ijZ $) and 

Xj ( j ^Si i ; L, <E» T j denote the forward and reverse link rates of the I th link of the i th network. 

Lemma 9: Let tl i>1:L = fs^i, S^lJ be the covariance transformation of S^i-x, = 

(Sj 1; Sj 2 , Sj L ), applied to the network. Then S^i^ = ^S^i, Sj j2 , ^ is also the 
covariance transformation of £i,i : i = £ ii2 , £«,«)> applied to the i th sub-network d42l) . 

Proof: The interference from link z + 1,...,L is counted in the colored noise in the i th 
sub-network (1421) . Therefore, the MMSE-SIC receive vectors {r i: i >m , I — 1, i} are the same in 

3 If Hi,! = 0, we can still increase the objective function by improving the rate of the second link in the ([2 — fc] L ) th network, 
and so on. 
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both i network and its sub-network. Because the i network is an iTree network, in its dual 
network, there is no interference from link i + 1, L to link 1, i by Lemma [6l Therefore, for 
link 1, i, there is no difference between the i th dual sub-network (|43l) and the i th dual network. 
In both dual networks, the transmit powers of the first i links given by (fl"8l) in the covariance 
transformation achieve the same SINRs as in the i th network. But for fixed {r it i >m , I = 1, 
and {U^mJ = 1, the transmit powers producing the same SINRs are unique [50, Lemma 

1]. Therefore the transmit powers of the first i links must be the same for the two dual networks, 
which implies that Lemma [9] must hold. ■ 
Lemma 10: The necessary and sufficient conditions for X3i : £ to be the optimal solution of 
problem (1371) for an iTree network with concave objective function are listed below. 

1) It possesses the polite water-filling structure as in Definition [51 

2) The water-filling level v\ for £j is given by v\ = Wi/p,, where fi > is chosen such that 



The proof is given in Appendix [0 

Then we give an algorithm to improve the objective function if £ (1) does not satisfy the 
two necessary conditions in Theorem [TT] It contains three steps. Note that S (1) contains 
the input covariance matrices of the i th link of the i th network for % — 1, ...,L, i.e., S (1) = 



Step 1: For i = l,...,L, calculate X^i^-i by the covariance transformation applied to the i th 
sub-network. Due to the special interference structure of the iTree network, the calculation of the 
reverse transmit powers of the covariance transformation can be simplified to be calculated one by 
one as follows. When calculating q^ m , the transmit powers {qi^, m '■ m — 1, M i:k , k = 1, 
/ — 1} and {qi t i )n : n = 1, ...,m — 1} have been calculated. Therefore, we can calculate = 
Y^m=i Qi,k,m r i,k,m r i jfc m , k = 1, ...,/ — 1 and obtain the interference-plus-noise covariance matrix 
of the reverse link I as Q it i = I + J2k~=i *Jfe,i H *,i^i I fc H fc,i- Then %i,m is g iven b Y 



Ef =1 Tr (E,) = P T . 



(44) 




2 
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and correspondingly 

Mi-, 



m=l 



By Theorem H we have Z M (e^, $ T J > 2^ (S ijl:< , *) = 2y (E^, *) , / = and 
E; =1 Tr (Ey W w ) = Eli Tr (E M ) = J* . 

Step 2: Improve Ef=i by enforcing the polite water-filling structure on 

Ej i,Vz. By Lemma [6[ in the i th sub network, the reverse link i causes no interference to the 
first i — 1 reverse links. Fixing Ey, I = 1, i — 1, Vi, we can improve Ei=i (Ej jl: j, 
without reducing the rates of reverse link 1, i — 1 in the i* h sub-network for all i by solving the 
following weighted sum-rate maximization problem for L parallel channels with colored noise: 



L 

max VVlog I + H} i S M H iji nri (46) 

£ M ^0,Vi=l,...,L 
L L / i—1 

s.t. ^ Tr (s M W M ) < X) P t - X) Tr W ^ 

i=l i=l \ 1=1 

where n M = I + Eti $M H f,A* H M and W M = 1 + E;= m ^H^-E^H^ = This 
is a classic problem with unique water-filling solution. The solution with sum power constraint 
is obtained in J9]|, which can be extended to the case of linear constraint here. Alternatively, 
the solution can be obtained from Theorem \5\ and Lemma |4] for a network of parallel channels. 
Perform the thin SVD fl~J /2 B. i:i n~J /2 = F^G}. Let N, t = Rank (H M ) and 6 itj be the j th 
diagonal element of Af. Obtain as 

Dj = diag(d i> i,...,d i| jv i ), 

d <-> = (j-j-) + - 3 = 1 - n " <47) 

where fi is chosen such that Ef=i Y^f=i dij — J2i=i [Pt ~ E!=i ^ r \^hi^hi)\ The calcula- 
tion of fx is easy: 1) Let index the j th eigen-channel of the i th network's I th link. Initialize 
the set of indices of channels with nonnegative power as T = {(1, 1) , (1, N±) , ... (L, 1) , 
(L,N L )} to include all eigen-channels; 2) Calculate ll for the channels in T without (•)+ 
operation; 3) For all (n, m) E T, if d n m < 0, fix it with zero power d n m = 0, delete (n, m) 
from T. Repeat step 2) until all d nm > 0. Then the optimal solution of problem (|46l ) is given 
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by 

% = n^F^Ffn^ 2 , i = 1, l, (48) 

i.e., = 1, satisfy the polite water-filling structure and the water-filling levels are 

proportional to the weights {u^}. By Theorem [6l if S (1) = (X^, does not satisfy 
the polite water-filling structure or the water-filling levels are not proportional to the weights, 
nor does which means that it is not the optimal solution of problem (l46l) . 

Therefore if we let S^ 1: - = (i^, £ M _i, £• ^ , Vz, we must have £]f =1 iu^m > 

Step 3: Improve the objective function by the covariance transformation from £• Ui to S- 1:i = 
S- j) for all the L sub-networks. Again, the calculation of the transmit powers of the 
covariance transformation can be simplified similar to (1441) . Let £• hL = (£• 15 £• i; S iji+1 , 
S i)L ),Vi. By Theorem® we have *) > X M (X $ T ) , Z = 1, i and Ef =1 ELi 

Tr(s;j = Eti (E;=iTr(s i ,iW i ,) + Tr (j^Wy) ) = Eti p h which implies that the 
sum power of (S x 1 . L , Y! L 1L ) is still LP T . Note that the first i links cause no interference 
to other links and X^ (E s 1L , $) = X^ (£• $) , / = 1, i. Combining the above facts and 
the second property in Lemma [H the objective function can be strictly increased by updating 
£ (fc)'s as 

1 L 

= -^S'(Z),l<fc<L, (49) 

where (£' (1) , £' (L)) is obtained from 1:L ) according to (1401) ; and the updated 

£ (fc)'s satisfy Ef=i Tr ( s « (*)) = ^T,Vfc. 

This contradiction proves that the optimal £; (k) , / must satisfy the polite water-filling 
structure and the water- filling level for £; (k) is given by wi/fik- The condition that jl k = /}, Vfc 
can also be proved by contradiction. If jl^s are different, the water-filling levels wi/jlis of the 
input covariance matrices S x 1:L for the first network are not proportional to the weights. Then 
by Lemma [TOl S x 1:L is not the optimal solution of problem (|37l ) with sum power constraint 
E^i^r f° r me fi fSt networ k, which implies that there exists S' x VL achieving a higher 

weighted sum-rate for the first network with the same sum power. Similarly, for the i th network, 
Vz, there exists £ ■ UL achieving a higher weighted sum-rate with the same sum power. Then 
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after updating £ (fc)'s from £• 1 . L 's using ( |49l ), the objective function of problem (|39l ) is strictly 
increased. This contradiction completes the proof for the necessity part. 

Now, we prove the sufficiency part. For convenience, we map £ (1) , £ (L) to fSi^x, 

using (|40l) and show that 1b L satisfies the KKT conditions of problem 

( |39| ). The Lagrangian of problem (l39l) is 



L (/il:L? ©1:L,1:L) £l:L,l:Z,) 



L L 



i=l 1=1 

L / L 

+Z>* p ^-E Tr ( s w) 

fc=i V i=i 

+ ^^Tr(S M M ). 



2=1 Z = l 



The KKT conditions are 



V 33ii ,L = 0; ^Tr(Sp_ fc]i , { ) = P T ; 

i=i 

Tr (S M e M ) = 0; /x fc > 0, £<,,, M b 0; (50) 
for all i,k,l = 1, L. The condition Vs^ t L = can be expressed as 



——<f> k M 



x (n-* - (n hk + H Jfc>fc E iifc Hl )fc ) J H fc , z + I 

■Hf,('n i) , + H, ) ,S ii ,H 1 t I , )~ 1 H {i{ +^— M . (51) 



Since £j.i:L satisfies the polite water-filling structure with water-filling levels given by (w;//t)'s, 
by Theorem H the dual input covariance matrices £o : l can be expressed as 

% >k = ^ UlTl - (h i>k + H k>k t i>k B.Q J , Wk. (52) 
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Then we have 

k+i **• 

x - (fl i>k + H M E i)fc Hi )fc )~ 1 ^ + I 

Choose the dual variables as fik = p>, V/c and substitute into condition (|5TT) to obtain 

fii,, = ^H*, (n i( , + H M S M Hj^ ~* H M + 4e^. (53) 
Because Equation (l53l) is also the KKT condition of the single-user polite water-filling problem 

~ —1/2 * 1/2 r- 

over the channel Q i 1 1 H/ ) jf2 i z and X^ has polite water-filling structure over this channel with 
water-filling level it?j/// by the optimality condition, satisfies condition (|53l) . 

It can be verified that ySi t i.L, E^i^J also satisfies all other KKT conditions in (l50l) . Since 
problem (|39l ) is convex, KKT conditions are sufficient for global optimality. This completes the 
proof for Theorem [TT] 

Algorithm P: The proof of Theorem ITTI actually gives an algorithm to find the optimal solution 
of problem (l39l) by monotonically increasing the objective function. We refer to it as Algorithm 
P and summarize it in Table H where P stands for Polite. Note that although in each iteration, we 
only enforce the polite water- filling structure in Theorem ITTI on X (1), all S (fc)'s will eventually 
satisfy the polite water-filling structure due to (|49|) . 

The convergence and optimality of Algorithm P is proved in the theorem below. 

Theorem 12: For any iTree network with concave weighted sum-rate function /(•), Algorithm 
P converges to the optimal solution of problem (|39| ) and problem (1371) . 

Proof: In each iteration, Algorithm P monotonically increases the objective function in 
problem (l39l) . Since the objective is upper bounded, Algorithm P must converge to a fixed point. 
At the fixed point, X (1) must satisfy the optimality conditions in Theorem ITTI Otherwise, the 
algorithm can strictly increase the objective function, which contradicts with the assumption of 
fixed point. Then at the fixed point, all S (fc)'s satisfy the optimality conditions in Theorem ITTI 
because at the end of each iteration, we make all S (fc)'s the same in (l49i By Theorem QTJ 
the fixed point is the optimal solution of problem (|39| ). And by Lemma [H it must also be the 
optimal solution of problem (1371) . ■ 



May 4, 2010 



DRAFT 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY, APR. 2010 



37 



Table I 

Algorithm P (for iTree Networks with Concave /(■)) 



Choose a feasible initial point S (k) = (£1, El) , k = 1, L 
such that E; t 0,VZ and Ya=i Tr ( s *) < p T- 
While not converge do 

1. Calculate ^l.i-.l-i by the covariance transformation applied 
to the L th sub-network. Because all E (fc)'s are the same, 
the Ei.i : i_i's for other sub-networks can be obtained by 

E^itj-i = (fjL,U — J ^L,i-l) ! i — 1) — > L — 1 

2. Solve for the optimal E i /s in the optimization problem (l46T i 
by polite water-filling. 

3. For Vi, calculate E^ 1:i by the covariance transformation of 
E.j ui = ^S; i, £; ,_!, E^ applied to the i th sub-network. 
Obtain ^E (1),...,£ (L)j from fs^ l-L , £^ according 
to ®, where = (s^ E^.E,.,., E, ,.) ,Vi. 

4. Update E (fc)'s using d49| i. 
End 



Algorithm Pl: Another algorithm named PI can be designed with improved convergence 
speed using a similar method as in ||2TT| by modifying the updating step in ( |49l ) at the cost of a 
simple linear search. The modified update rule is 



op (l-/3)P T L 



^ Ei=2 P l 1=1 

where P k = Yld=i T r i^'i (^)) >VA;; and /3 is the optimal solution of the following optimization 
problem 



max 



mPi/(LP T ),l] \ Pl Et=2 P l 1=2 

It is clear that the update (l54~l) yields at least the same improvement of the objective function as 
the update (|49l) because if we let ft = P x j (LP T ), d54j) reduces to <@9]). Algorithm Pl is provably 
convergent and is observed to have much faster convergence speed in the simulations. 
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D. Algorithms for General B-MAC Networks 

Algorithm PT: We obtain Algorithm PT, standing for Polite-Transformation, for general B- 
MAC networks by a modification of Algorithm PI so that each iteration is simply a polite water- 
filling in the forward links and a covariance transformation in the reverse links. It is observed that 
(3 in (l54l) is always close to 1 in simulation, which suggests a more aggressive iterative algorithm 
by modifying Algorithm PI as follows: Change the linear constraint of problem (|46|) in step 2 to 
sum power constraint: J2i=i Tr — Pt\ In step 1, simply let = / = 1, i — 1, i = 

1, ...,L, where S^,/ = is the solution to the modified problem of (l46l) under sum 

power constraint in the step 2 of the previous iteration; In step 2, solve the optimal Sj/s for 
the modified problem of (1461) . where in (1471) . /i is chosen to satisfy the sum power constraint 
Yla=i T f \^'hi) = -^ >r ' m ste P 3> treating ^S' x 1; ,2> •••5 as me m P ut covariance matrices 

of a single network, obtain S' (1) from the covariance transformation of 1, •••> 
and let /3 = 1 in the update (1541) ; Because of the symmetry, we can change the roles of the 
forward and reverse links so that the water- filling is for the forward links. Algorithm PT can be 
reorganized in a simpler form as in Table [II] in terms of one network without expanding to L 
networks. 

Note that Algorithm PT does not require any special interference structure or concavity of 
the weighted sum-rate function. Therefore, it works for general MEMO B-MAC networks. After 
convergence, the solution of Algorithm PT satisfies the KKT conditions of WSRMP, and thus 
is a stationary point. 

Theorem 13: Apply Algorithm PT to solve problem (1371) . whose Lagrange function is 



L 



L (fJL, &1.L 



1=1 
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Table II 

Algorithm PT (for B-MAC Networks) 



Initialize Ei : l and Jjul with zeros or other values such that y 0,V£. 

While not converge do 

1. Polite water-filling in the forward links 

a. For V/, obtain fi; and fli from and Xi : £ using (f2]i and © 
respectively. Perform thin SVD f2~ 1/2 H M n~ 1/2 = F/A;Gj\ 

b. Let Si^ be the i th diagonal element of A^ and let pi^ be the 
norm square of the i th column of Cl l 1 ^ 2 G/. Obtain D; as 

D; = diag(di A , ...,d LNl ), 

dl,i=(f-Tt) + ' i = 1 >-> Nl > 
where /i is chosen such that Ylt=i Ym^i Pl,%di,i = Pt- 

c. Update S;'s as 

s ; = n; 1/2 G l -D l G]ci; 1/2 ,vi. 

2. Covariance transformation in the reverse links 

Obtain from the covariance transformation of using ( fl9] l. 

End 



and the KKT conditions are 

= 0; 

L 

^Tr(£,) = P r ; 

i=i 

Tr(Sie,) = 0; 

/u>0; £,,e,>:0; (55) 

for all / = 1, L. If Algorithm PT converges, the solution of the algorithm satisfies the above 
KKT conditions, and thus is a stationary point. 

Proof: Let Si : £ and be the solution of Algorithm PT after convergence. It is clear that 
Si : £ satisfies the polite water- filling structure and the water-filling levels z/j's are proportional 
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to the weights, i.e., i>\ = Wi/fi. Furthermore, we have S; = i>\ {^l^ 1 — ^H^SjHj^ + f^j J 
by Theorem [7J The rest of the proof is similar to the proof of the sufficiency part in Theorem 
QTJ Choose the dual variable as fi = p, and substitute Si : £ into condition Vs ; L = to obtain 

ft, = ^H^(a + H M S^) _1 H M + iei, (56) 

which holds because of the polite water-filling structure of It can be verified that S 1;L 

satisfies all other KKT conditions in (1551 ) as well. ■ 
If the weighted sum-rate function in WSRMP is concave, the problem is convex and any 
stationary point is also a global maximum. In other cases when the weighted sum-rate function 
is not concave, we cannot guarantee that Algorithm PT converges to the global optimum since 
it may get stuck at some other stationary point. Nonetheless, Algorithm PT gives very good 
solution. 

We prove for two simple examples that Algorithm PT converges monotonically to a stationary 
point. More discussion on the convergence is given in the Remark [TOl 

Example 1: Consider weighted sum-rate maximization for a 2-user SISO MAC with w 2 > W\, 
where the user 1 is decoded first. The optimization problem is 



, A , / 1 + PlQl + V292 I 

max/ = toilog — + w 2 log (1 + p 2 92) 



P1.P2 \ 1 + P292 

s.t.pi +p 2 < P T , (57) 

where pi is the transmit power and gi is the channel gain. Algorithm PT updates the transmit 
powers as 

1 + P 292 " 

9i 

1 + P292 



Pi = VWi 



P2 = vw 2 



1 + Pig* + P2#2 92 , 

and v is chosen such that p\ + p' = P T . Suppose Then l±Pim±vm > kkpm+msi, if 

„ _ i+pi9i±P292^ men _ p i an( j ^ < p 2 Therefore we should increase the water- filling level. 



wi9i 



Similarly if v = 1+Pl ^lg^ 292 , then p 1 > p\ and p 2 — p 2 . Therefore the water-filling level must 



be decreased. Then we have l+sm+vm < v < l+vm+vm _ Because of u > 1+Pl3l+P2g2 , p x is 

wigi wigi w\g\ rL 

increased and thus the transmit powers are updated according to the gradient direction. But if 
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we increase pi too much, the direction of the gradient may change, then the objective may still 
be decreased. However it follows from v < 1+Plg2+P2g2 that M- > jr- after the update. Hence 

W292 opi ap2 r 

the objective is strictly increased after each update until J^j- = J^-, which is the stationary point 
of problem (1571) . 

Similarly, one can prove the result for sum-rate maximization of a 2-user SISO Z channel. 

Algorithm PP: We also designed Algorithm PP, standing for Polite-Polite, that uses polite 
water- filling for both the forward and reverse links. It is modified from Algorithm PT, where 
each iteration is a polite water-filling in the forward link and a covariance transformation in the 
reverse link. The algorithm is shown in Table [TITJ It can be proved that Theorem [13] also holds 
for Algorithm PP following similar proof as that for Algorithm PT. 

It is observed in the simulations that both Algorithm PT and Algorithm PP have faster 
convergence speed and higher accuracy than the algorithms not exploiting the optimal input 
structure. In most cases, Algorithm PT and PP are observed to have similar convergence speed. 
But for networks with strong interference loops such as strong interference channel, Algorithm 
PP is observed to have a better convergence behavior than Algorithm PT. Another advantage of 
Algorithm PP is that the polite water-filling procedure to obtain has lower complexity than 
the covariance transformation in Algorithm PT, as will be discussed in Section HV-El 

Remark 10: A nontrivial future work is to find the convergence conditions of Algorithm PT 
or PP for iTree networks and general B-MAC networks. It has been observed that Algorithm PT 
and PP always converge monotonically in iTree networks with concave or non-concave objective 
functions, just like the provably monotonically convergent algorithms designed for iTree networks 
in ll40l . It is possible that the structure of the iTree networks guarantees it, which suggests that 
iTree networks has more useful properties to be found. For general B-MAC networks, one 
possible solution may be extending the approach used in [|33l for iterative selfish water-filling 
to polite water- filling. It is proved in 11331 that under certain stringent conditions, the selfish 
water-filling is a contraction mapping, and thus the uniqueness of the Nash equilibrium and the 
convergence of the iterative selfish water-filling is guaranteed by the fixed point theory. The 
physical interpretation of these conditions is that the interference among the links is sufficiently 
small. Because in the polite water-filling, the interference to other links is well taken care of by 
the pre-whitening of the channel using Cl\ T 1 ^ 2 , we conjecture that the convergence conditions for 
Algorithm PT or PP will be much looser than iterative selfish water- filling. This conjecture is 
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Table III 



Algorithm PP (for B-MAC Networks) 




Initialize and Hi's such that Yld=i ^ r (^') — 

% t 0,VZ and fl t = I, VI 
While not converge do 
1. Polite water-filling in the forward links 

a. For VI, obtain tti from 'Sul using ([6j. 
Perform thin SVD fi~ 1/2 H M f2~ 1/2 = FjAjGj. 

b. Let (5; ^ be the i*' 1 diagonal element of and let p;^ be the 
norm square of the i th column of tl { 1 G/. Obtain D; as 

D; = diag(dis, ...,di iNl ), 



c. Update S;'s as 

S ; = r2 i " 1/2 G i D i G|^ 1/2 ,V/. 
2. Polite water-filling in the reverse links 

a. For V/, obtain $7; from Si : l using (f2]i. 
Perform thin SVD 0^ 1/2 H;,;0^ 1/2 = F;A;Gj. 

b. Let 5;,, be the i*' 1 diagonal element of A^ and let pi^ be the 
norm square of the i th column of 1 ^ 2 Fi. Obtain D; as 

D ; =diag (dis, ...,di iNl ), 




where /i is chosen such that 



Ef=i Hili Pl,idi,i = Pt- 




where /i is chosen such that 



c 



Update Sfs as 

% = rj- 1/2 F i D i F^- 1/2 ,v/. 



End 



verified by the simulations in Section |Vj where Algorithm PP is observed to converge for almost 
all general B-MAC networks simulated. Algorithm PT also converges for most general B-MAC 
networks simulated except for the interference channel with very strong interference. But even 
for this case, Algorithm PT still converges after a few trials, e.g., three trials, to select a good 
initial point. Another approach to convergence proof may be interpreting the polite water-filling 
as message passing in a factor graph and derive the conditions of convergence on the graph. 



Remark 11: The following is the connection to another iterative water- filling algorithm pro- 
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posed in (351 Section IV] for parallel SISO interference networks. It is derived by directly solving 
the KKT conditions of the weighted-sum rate maximization problem. As expected, the result 
also has a term t k {n) related to the interference to other nodes. In the SISO case, the matrix Cti 
equals to 1 + in 11331 only for stationary points. Algorithm PT or PP can be used there to 
reduce the inner and outer iterations to one iteration, reducing the complexity. On the other hand, 
we can design an algorithm by directly solving the KKT conditions for the MIMO case and by 
using the concept of polite water- filling. It results in an algorithm that has similar performance 
to Algorithm PT but does not take the advantage of the duality. In the algorithm, we replace the 
matrix f2; by the term 



in (l55l) . avoiding the covariance transformation in Algorithm PT and reverse link polite water- 
filling in Algorithm PP. But as a result, the equivalent channel becomes a function of ji and in 
order to perform the polite water-filling, SVD has to be repeatedly done while searching for fi 
to satisfy the power constraint, increasing the overall complexity. 

E. Complexity Analysis 

We first give a brief analysis to show the complexity order per iteration of the proposed 
algorithms. The main computation complexity lies in the SVD and matrix inverse operations 
in the polite water-filling procedure and in the calculation of the MMSE-SIC receivers for 
the covariance transformation. Note that both SVD and matrix inverse are performed over the 
matrices whose dimensions are equal to the number of the transmit and receive antennas at each 
node and are not increased with the number of transmission links L. This, however, is not true for 
generic optimization methods which do not take advantage of the polite water-filling structure of 
the optimal solution. Since the operations of SVD and matrix inverse have similar complexity, 
we use the order of the total number of SVD and matrix inverse operations to measure the 
complexity. Note that we need to calculate Mi MMSE-SIC receive vectors for the I th link in the 
covariance transformation. However, as pointed out in [1551 . if the precoding matrix T/ is chosen 
to decorrelate the streams of link /, we only need to perform one matrix inverse and one SVD 
to find such precoding matrices and the corresponding MMSE-SIC receive vectors. 
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• We give the complexity order of each iteration of Algorithm P. In step 1 , we need to calculate 
E ii i :i _ 1 by the covariance transformation, which has a complexity order of O (L). In step 
2, the complexity order of the polite water-filling procedure to obtain S-/s is also O (L). 
And in step 3, the covariance transformation to calculate S 4 % — 1, L has a complexity 
order of O (L 2 ). Therefore, the total complexity order is O (L 2 ). 

• Algorithm PI only adds a linear search compared to Algorithm P. Therefore, it has the same 
complexity order as Algorithm P. 

• The complexity order of Algorithm PT depends on whether the network under optimization 
is iTree network or not. For iTree network, it is clear that the covariance transformation to 
obtain in step 1 and the polite water-filling to obtain IV s in step 2 have a complexity 
order of O (L) because the reverse link power in the covariance transformation can be 
calculated one by one. For other networks, to calculate the reverse transmit powers in 
the covariance transformation, we need to solve a Yli=i M r dimensional linear equation as 
in (fl"8l) . whose complexity depends on the density and structure of the cross-talk matrix 
^ (T,R). In the worst case, the complexity order is O (L 3 ). In other cases such as with 
triangular or sparse ^ (T,R), the complexity is much lower. Fortunately, there are many 
fast algorithms to solve the linear equations even for relatively large dimensions, and in 
practice, ^ (T, R) is usually sparse for a large wireless network because of path loss or 
interference suppression techniques. 

• The complexity order of Algorithm PP is O (L) regardless of whether the network is an 
iTree network or not since the reverse input covariance matrices are also obtained by polite 
water-filling. 

In practice, Algorithm PT is the choice for iTree networks because its performance is similar 
to Algorithm PI but the complexity is much lower, while Algorithm PP, which has even lower 
complexity, is a better choice for general B-MAC networks because it also has fast convergence 
speed and the convergence behavior is less sensitive to the interference loops and interference 
strength. Therefore, Algorithm P and PI are more of theoretic value. Although Algorithm P and 
PI have higher complexity than the O (L) algorithms in flU, EH, EQl, (2T]|, they are still much 
simpler than the standard interior point methods of complexity order of O (L 3 ) ll66l and can be 
used in more general cases compared to those for MIMO MAC/BC. 
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Second, we discuss the number of iterations needed or in other words, the convergence speed. 
The number of neighbors of a node in the network is expected to have little influence on the 
convergence speed of the designed algorithms because at each link, the forward link interference- 
plus-noise covariance matrix and its reverse link dual summarize all interference no matter how 
many neighbors of the link are there. The convergence speed is expected to depend on the 
girth of the network because each iteration propagates the influence of the transmission scheme 
of a transmitter to its closest neighbors. But in a wireless network, large girth with uniformly 
strong channel gains has small probability to happen. In addition, the optimal polite water- 
filling structure is forced on to the solution at each iteration. Therefore, the number of iterations 
needed is small. In simulations results, we observe that 1 .5 iterations achieves much of the rate 
and 2.5 iterations achieves most of the rate of 50 user multiaccess channels and two or three 
user interference channels, where half an iteration refers to an update in the forward link or 
reverse link. Define the accuracy as the difference between the performance of a certain number 
of iterations and the maximum, if known. Accuracy versus iteration numbers gives a closer look 
at the asymptotic convergence behavior. As expected, the algorithms in this paper have superior 
accuracy because of the optimality of the polite water-filling structure. 

In summary, Algorithm PT and PP are expected and have been verified by simulation to have 
superior performance, complexity per iteration, convergence speed, and accuracy than other 
algorithms that do not take the advantage of the optimal input structure. 

V. Simulation Results 

In this section, simulations are used to verify the performance of the proposed algorithms. 
Block fading channel is assumed and the channel matrices are independently generated by 
H^fc = ^/gi^H-i^, \fk, I, where has zero-mean i.i.d. Gaussian entries with unit variance 
and gi k ,\fkj is a constant. In most cases, we let g i k = OdB,Vfc,/ except for Fig. [10] and Fig. 
ITTl For weighted sum-rate, the weights are randomly chosen between 0.8 and 1.2. In Fig. [3}|5l Fig. 
[TOl and Fig. QT1 each simulation is averaged over 100 randomly generated channel realizations 
to show that the performance difference is not an accident. In all other figures, the simulation 
is performed over a single channel realization to show the details of the asymptotic accuracy 
or the effect of different initial points. In Fig. [9[ Fig. [TU and Fig. [121 where we run Algorithm 
PT or PP with several initial points, the first initial point is chosen to be zero and other initial 
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Figure 3. Convergence speed comparison of the sum-rate for a 10-user MAC. 
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Figure 4. Convergence speed comparison of the sum-rate for a 50-user MAC. 
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Weighted sum rate for 10 users 




Number of effective iterations 



Figure 5. Convergence speed comparison of the weighted sum-rate for a 10/50-user MAC. 
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Figure 6. Sum-rate accuracy of the algorithms for a 50-user MAC. 
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Figure 7. Weighted sum-rate accuracy of the algorithms for a 50-user MAC. 
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Figure 8. Convergence speed of the algorithms for a Z channel and an interference channel. 



points are randomly generated. In all other cases, we use zero initial points for all algorithms. 

To make fair comparison of the convergence speed by taking into account the complexity, 
we use effective iterations. Each iteration of the Algorithm PT or PP consists of an update in 
the forward links and an update in the reverse links. The rates are output at 0.5, 1.5, 2.5, etc., 
effective iterations. For Algorithm PI, the i th iteration is also the i th effective iteration because 
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Figure 9. Convergence speed of Algorithm PP with different initial points for a 2-user X channel. 
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Figure 10. Convergence speed comparison with selfish water-filling for the iTree network in Fig. [2] 



each iteration consists of the update in the reverse and then the update in the forward links. 
For the other algorithms that we compared with in the literature, the i th iteration is counted 
as the («/2) th effective iteration because they only have update in the forward links. This is a 
conservative approach because some of these other algorithms' complexity of each iteration is 
higher than that of one half iteration of Algorithm PT or PP. Nevertheless, Algorithms PT and 
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Figure 11. Convergence behavior comparison with selfish water-filling for a 3-user interference channel. 



10 




Rate of user 1 (bit/channel use) 



Figure 12. Achieved rate regions of a two-user interference channel. 



PP still show clear advantages. 

We first demonstrate the superior convergence speed of Algorithms PI, PT and PP Fig. [3]|5] 
plot the (weighted) sum-rate versus effective iteration number for a MIMO MAC with 2 transmit 
antennas at each user and 8 receive antennas. The scenarios of sum-rate, weighted sum-rate, 10 
users, and 50 users are simulated. The results are compared with those of the steepest ascent 
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algorithm in EOl with Matlab code from ||69l , the 'Original Algorithm' and 'Algorithm 2' of 
JRVG in with Matlab code from Il70l . and the dual decomposition algorithm in ifTOl . The 
'Original Algorithm' in [0 is obtained by enforcing the water-filling structure for the sum-rate in 
|[8l at each iteration. It is observed in (9] that the sum-rate of the 'Original Algorithm' grows fast 
initially and then may decrease and/or oscillate. Therefore, a hybrid algorithm called 'Original 
+ Algorithm 2' is considered the best in 0, where the 'Original Algorithm' is performed for 
the first five iterations, and then Algorithm 2' in [0 is used for all the subsequent iterations. 
It is observed that Algorithms PI, PT and PP have similar initial convergence speed as that of 
'Original + Algorithm 2' and outperform all other algorithms. As expected, the convergence 
speed of Algorithms PI, PT and PP appears to be independent of the number of users because 
the girth of the interference graph of the MAC is one, regardless of the number of users. In 
Fig. [6] and Fig. H we compare the asymptotic convergence speed, or accuracy, defined as the 
errors from the maximums of the sum-rate and the weighted sum-rate respectively. Algorithm 
PI, PT and PP have the highest accuracy than other algorithms except for the dual decomposition 
algorithm in lfT0ll . This is not surprising because the dual decomposition algorithm is a water- 
filling directly on the objective function, sum-rate, while the polite water-filling is on each link. 
The errors accumulate when summing all the links' rates together. In summary, for sum-rate, the 
best algorithm will be 'Original Algorithm + Algorithm PT or PP' for a compromise of initial 
convergence speed and accuracy. For weighted sum-rate, the best is Algorithm PT or PP. 

In Fig. [H we plot the sum-rate for a Z channel with concave sum-rate function and the 
weighted sum-rate for a 3-user interference channel. In both channels, each node has 4 antennas. 
For the Z channel, it can be observed that Algorithms PI and PT have faster convergence speed 
than Algorithm P. For the 3-user interference channel where the problem may be non-convex, 
Algorithm PT and PP have the same performance and converge fast and monotonically to a 
stationary point. 

In Fig. we show the convergence speed of Algorithm PP for a nontrivial example of B- 
MAC networks, the two-user X channel, with 3 antennas at each transmitter and 3 antennas at 
each receiver. We plot the weighted sum-rate versus the effective iteration number with three 
different initial points for the same channel realization. For all initial points, Algorithm PP 
converges quickly to some stationary points. However, the convergence speed is observed to 
depend on the initial point. And due to the non-convexity of the problem, Algorithm PP with 
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different initial points may converge to different stationary points. 

As expected, the polite water- filling is superior to the selfish water- filling. The selfish water- 
filling can be obtained by fixing the fVs in Algorithm PT or PP as identity matrices. Selfish 
water-filling is expected to converge in iTree networks but not in networks with strong inter- 
ference loops as shown in Fig. \\0\ for the iTree network of Fig. [2] and in Fig. [TT] for a 3-user 
interference channel. Each node is assumed to have four antennas. In the upper sub-plot of Fig. 
ITOl we consider the moderate interference case, where we set gi t k = OdB, \/k,l. In the lower 
sub-plot of Fig.QjJ we consider strong interference case, where we set g i 3 = lOdB, I — 1,2 for 
the interfering links, and = OdB for other k, Vs. It can be observed that the three algorithms 
converge in both cases. But Algorithm PT and PP achieve a much higher sum-rate than the 
selfish water-filling. In the upper sub-plot of Fig. [TH we set gi jk = OdB, Vfc, /, and similar results 
as in Fig. [10] can be observed. In the lower sub-plot, we consider a strong interference channel, 
where we set = 10dB,VA; ^ I, and gi^ = OdB,VA; = I. In this case, the selfish water-filling 
based algorithm no longer converges. Algorithm PT converges with a good initial point found by 
a typically three trials. But even with a non-converging initial point, Algorithm PT still achieves 
a higher sum-rate than the selfish water-filling. Algorithm PP converges for almost all channel 
realizations, which demonstrates its insensitivity to the interference loops and strength. 

Algorithms PT and PP can be used to find the approximate boundary of the convex hull of 
the achievable rate region of MIMO B-MAC networks. In Fig. [J_2l we plot the approximate 
boundary achieved by Algorithm PP for a two-user interference channel with 3 antennas at each 
transmitter and 4 antennas at each receiver. The sum power constraint is P T = lOdB. The weights 
are w\ = \i and u>2 = 1 — fJ>, with \i varying from 0.01 to 0.99 with step 0.01. The boundary (dot 
line) obtained from a single initial point for Algorithm PP is close to the pseudo global optimum 
(solid line), which is the best solution of Algorithm PT with many randomly generated initial 
points. The result demonstrates that Algorithm PP can find good solutions even with single initial 
point. Not showing is that Algorithm PT with a single initial point also achieves near pseudo 
optimum boundary. 

VI. Conclusions and Future Work 

This paper extends BC-MAC rate duality to the MIMO one-hop interference networks named 
B-MAC networks with Gaussian input and any valid coupling matrix. The main contribution is 
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the discovery that the Pareto optimal input has a polite water-filling structure, which optimally 
balances between reducing interference to others and maximizing a link's own rate. It provides 
insight into understanding interference in networks and an elegant method to decompose a 
network to multiple equivalent single-user links. It can be employed to design/improve most net- 
work optimization algorithms. As an example, low complexity weighted sum-rate maximization 
algorithms are designed and demonstrated to have superior convergence speed and accuracy. A 
sub-class of the B-MAC networks, the interference tree (iTree) networks, is identified, for which 
the optimality of the algorithm is discussed. iTree networks appears to be a natural extension of 
broadcast and multiaccess networks and possesses many desirable properties. 

The results in this paper is a stepping stone for solving many interesting problems. Some 
future work is listed below. 

• Extension to Han-Kobayashi Transmission Scheme: Han-Kobayashi scheme cancels more 
interference and is especially beneficial when the interfering channel gain is large. But 
its optimization is still an open problem. As discussed in Remark [7] of Section IIII-Bl the 
Lagrangian interpretation of the reverse link interference-plus-noise covariance matrix for 
the polite water-filling makes it possible to extend the duality and polite water-filling to 
Han-Kobayashi transmission scheme so as to help understand the optimal input structure 
and design low complexity optimization algorithms. The approach in this paper may also 
be useful in multi-hop networks, relay channels, and ad-hoc networks. 

• General Linear Constraints, Cognitive Radio, and Other Optimization Problems: Polite 
water- filling is beneficial to most B-MAC network optimization problems. For example, the 
extension to a linear constraint in Section IIII-CI means that the duality and polite water- 
filling can be powerful tools to optimize networks with general linear constraints, such 
as per antenna power constraints and interference constraints in cognitive radio. In these 
problems, the single linear constraint in this paper becomes a weighted sum of multiple 
linear constraints where the weights are Lagrange multipliers. Searching these Lagrange 
multipliers together with the primary variables is an advantage of the efficient interior point 
algorithms in ll56ll . [|57l . The techniques in this paper can lead to better iterative algorithms 
by taking advantage of the structure of the problems: Polite water-filling can solve the 
problem efficiently to obtain Q;'s for fixed Lagrange multipliers, where the covariances 
fVs are functions of the Lagrange multipliers. Then, at each iteration and for fixed Q;'s, 
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we may update the Lagrange multipliers by forcing = fl l ' Q;S~2 ; ' to satisfy the 
linear constraints so that both the optimal primary and dual variables can be found jointly. 
A better approach is to introduce virtual nodes for the constraints and take advantage of 
the polite water-filling and duality for the larger network where those Lagrange multipliers 
become some of the reverse link powers. Other optimization problem examples include 
ones with quality of service requirement, such as minimizing power under rate constraints 
and maximizing the minimum of weighted rates under a sum power constraint in our recent 
work ||40t 

• Distributed Optimization and Finite Rate Message Passing: This paper studies centralized 
optimization with global channel state information and a sum power constraint. In practice 
and for large networks, we have to design distributed/game-theoretic optimization algorithms 
with partial channel state information under individual or group power constraints [|39l . Il40ll . 
[71]. The insight from polite water-filling structure turns out to be very useful as it turns the 
problem into single-user optimization problems under the influence of interference from and 
to others as summarized by the covariance matrices Qi and ft[. Partial or full knowledge of 
these covariance matrices can be obtained from reverse link transmission or pilot training in 
time division duplex (TDD) systems or from message passing among the nodes in frequency 
division duplex (FDD) systems ll72ll . The observation that very few iterations, usually two 
to three, suffices for the algorithms to achieve most of the gain makes the message passing 
approach meaningful. In practice, the message passing is further limited to finite rate. The 
single-user view of the polite water-filling structure makes it convenient to extend the results 
of the single-user finite rate feedback in f73l to B-MAC networks. 

• Extension to Fading Channels, OFDM Systems, and Adaptive Transmission: Fixed channel 
matrices are considered in this paper. It is straightforward to extend the results to time 
varying channels like fading channels and OFDM system by considering parallel B-MAC 
networks. Then, the polite water-filling will be across users, space (antennas), time, and 
frequency. The optimization of encoding/decoding order in general is an open problem. 
Progress may be first made for the iTree networks. In time varying channels, the insight 
of viewing reverse link power as Lagrange multipliers can be employed to design adaptive 
transmission algorithms where the Lagrange multipliers are adaptively adjusted. In fact, the 
iterative algorithms in this paper can be used with little modification by, e.g, updating to 
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the latest channel matrices whenever they are available. 
• iTree Networks: A fruitful approach to the open problem of network information theory 
is to study various special cases, such as deterministic channels IfTTTl . ||6TTl and degree of 
freedom (5]|, IfTTTl . Il62ll , in order to gain insight into the problem. iTree networks appears 
to be a useful sub-class of networks to study. Because there is no interference loops, the 
optimization of a link will affect some other nodes, but which in turn will not change the 
interference to this link. Therefore, it is possible to say more about the optimality of the 
transmission and make more progress on network information theory for iTree networks. 
The tree structure also makes it easier to analyze the distributed optimization by iterative 
message passing ll39ll . just like how the tree structure in LDPC decoding graph facilitates its 
analysis |[74j|. In fact, a factor graph interpretation of the polite water- filling may reveal some 
more insight. Because of mutual interference in wireless channels, there are likely many 
loops in the interference graph defined in Section ITV-B I Interference cancellation techniques 
like the Han-Kobayashi scheme [0 can be viewed as a method to break the strong loops 
and make the graph more like a tree, making iTree networks relevant to practice and more 
meaningful to study. 

Appendix 

A. Proof of Theorem [7] 

Assume r is a boundary point and S 1L achieves a rate point r' > r, and 3k, s.t. r k > 
r k . One can find an < a < 1 such that X\ (CS'i, aS^, S^) , <&) > r\, V/. Then, the 
extra power Tr (E^) — Tr (aSj can be used to improve all non-zero rate users' rates over 
r using (3 CS\, ...,aYl' k , H'A as the input covariance matrices, where /3 > 1 is chosen such 
that X^ fe Tr(/3£' z ) + Tr (/3aS' fc ) = P T . By the definition that the region is a union of sets 
{x : < x\ < Xi (£i : l, 3>) , 1 < / < L}, r must not be a boundary point, which contradicts with 
the assumption. Therefore, the statement in Theorem [TJ must be true. 

B. Proof of Theorem |?] 

Without loss of generality, we only prove Theorem |4] for link 1. We assume Mi — M,l — 
1,...,L for simplicity of notations. The proof is obtained by solving the KKT conditions of 
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another sum power minimization problem as will be formulated later in (1591 ), where the mini- 
mization is over Si and the transmit powers of other links p_i = [p|\ ...,p^] T £ j^C L ~ 1 ) Mxl 
with the transmit and receive vectors for other links fixed as {t fc m , r k>m , k ^ 1}. We first give 
Lemma [TT] which states that there exist dual variables satisfying the KKT conditions of problem 
<|59~1 ). Then we show that the dual variables in Lemma [TT] are the optimal dual variables of 
problem (1221) and they must be uniquely given by Theorem HJ Finally, we prove Lemma [TT] by 
the enhanced Fritz John necessary conditions 11751 Sec. 5.2]. 

First we define the notations used in the proof. Define four sub matrices of the cross-talk 
matrix * = * ( T, R ] in (O) as 



*i £ 



)MxM 



p Afx(i-l)Af 



where T 



* i,i € 

and R 



(L-l)MxM 



i G-+ 

;(i-l)Mx(L-l)JVf 



*1 

*_i £ 

[pl,m\ m =l,...,M,l=l,...,L 

and receive vectors. Let T>i £ M.+ xM and D_i £ -fig(L-i)Mx(L-i)M respectively be the sub 
matrices at the upper left and lower right corner of D ( T, R, 7 ) in (fTTl) . where 7 is set as the 



are the Pareto optimal transmit 



SINR values achieved by |t,R, pi, i.e., 7; „ 

b_i = d:1-*_!. 



li mi I < rn < M, 1 < I < L. Define 



For k — 2, L, let £ C^^- 1 )^ be the ((k - 2)M + m) th row of B_l 



Then we formulate another optimization problem in (1591) and show that the dual variables 
satisfying the KKT conditions of problem (l59l) is also the optimal dual variables of problem 
(1221) . Suppose for other links, the transmit and receive vectors are fixed as T and R respectively. 
In problem (122)) . the power allocation for other links is also fixed as p_i = [pf , p£] T . Now 
consider the following sum power minimization problem where the power allocation for other 
links is not fixed. 



s.t. 



min Tr(Si) + l T p_i 

Sih0,p-i>0 



(59) 



log 



I + H^SxH^^ 1 (p_! 



>Xi 



, k = 2,...,L 

bk,m -1 
-1 P-l < "I. 



m 



1,...,M 



(60) 
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where 



M 



^1 (P-l) = I + ^ *1.* H M Pfc.mtfc,mtfc im Hj 



k i 



fc=2 



m=l 



is the interference-plus-noise covariance matrix of link 1 . Note that the constraints in (l60l) imply 
the individual rate constraints: ]Cm=i lo S i 1 + 7fc,m) > Zk = Em=i lo S (1 + %,m) , A; = 2, L, 
where 



7*v 



L M 



(61) 



1 + Tr ( EiA (1) 



fc,m 



Z=2 n=l 



is the SINR for the m th stream of link k achieved by S ls p_ ls and {t fc m , r fcjm , 7^ 1}. This is 
because from the definition of b fc '"\ the constraint Tr ^SiA^M — b^'" l p_! < — 1 is equivalent 
to jk >m > % >m . It can be proved by contradiction that the Pareto optimal input Si and p_i is 
an optimal solution of ( |59l ). The Lagrangian of problem (1591) is given by 

L(A, !/!, 0, Si, p_0 = Tr (Si (A x (A) - 0)) + v£ x 



- filog 



1 + Hi.xExH^nr 1 (P-i 



+ 1 Vi 



A T 1, 



(62) 



,(L-l)Mxl 



are 



where the dual variables v\ G R+ and A = [A 2 ,i, A 2 ,m, ^l,m] £ 

associated with the rate constraint and the constraints in ( |60l ) respectively; Ai (A) is defined in 
(1241) ; © is the matrix dual variables associated with the positive semidefiniteness constraint on 
Si. Because this problem may not be convex, later, we will prove the following lemma which 
states that there exist dual variables satisfying the KKT conditions of problem (|59|) . 

Lemma 11: The KKT conditions are necessary for S 1; p_i to be optimal for problem (|59l) . 



i.e., there exist dual variables A 
such that 



)(i-l)Mxl ~ 



v x > and y 0, Tr Si0 



0. 



Si=Si 



0. 



p-i=p-i 



V Sl L^ V Sl L(A, u u G, Si, p_i; 

Note that fl 1 (p_i) = f2i. Then (1631) can be expressed as 

Ai (a) - f/iH^ (f2i + Hi^SiH^) 1 H11 -0 = 0. 



= 

(63) 
(64) 

(65) 
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Because (1651) is also the same as the equation that the derivative of Lagrangian of the problem 
(|22|) equals to zero, and S x is the optimal solution of problem (T22l) . A, z> 1; ©, S x also satisfy the 
KKT conditions of problem (1221) . Since problem (1221) is convex, A, 2/1, are the optimal dual 
variables of problem (1221) . 

We show below that A and V\ in Lemma [TT] is uniquely given by (l26l) and (T27T) . Combining 
this fact and Lemma [EH we conclude that A and v\ given by d26l) and (ITTT) are the optimal 
dual variables of problem (|22j) . This will complete the proof for Theorem HI It is difficult to 
directly solve A and v\ from (1631) and (l64l) . To simplify the problem, without loss of generality, 
we restrict Ei to be Ei = Tidiag (pi) T\ in the Lagrangian. For convenience, define 

M 

r{ (p) = lo § (i + 7i,m (T, R, p) ) , (66) 



where 7i m ^T, R, pj is the SINR of the m th stream of link 1 achieved by |t, R, p| as in (Q~3 



Note that log 



I + Hi^EiH^f^ 1 (p) = rf (p), and t f l m et lim = because Tr (^l©) = 
and is positive semidefinite. Then the Lagrangian (1621) can be rewritten as a function of A, v\ 
and p = [pi , p^x] T as follow. 



L(A,i/i,p) = EjLiPi.mti^Aj ^A) t 1>m + l T p_ 
+z>x (j x - (p)) - A T B_ lP _a + A T 1. 



Note that ([63]) and d64]) imply 

V P1 Z (A^ijp^p^]^ 
V P _ 1 I (^A,i>i, [pf,p T !] 



= 0, (67) 

pi=pi 



= 0. (68) 

p-i=p-i 



As will be shown below, the solution of the above equations is uniquely given by (l26l) and dTTl) 
Therefore, this unique solution must also be the solution of the more complex equations in (1631 
and (1641). 



For convenience, define qi = [qi,i, and 



Pl,m (1 + 71 
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where G\ 



f { m Hi i iti >m . It can be derived from (1661) and ^ 1>m = 7i jm IT,R, p) that 



dr{ (p) 

dpi sm 

„ dr{ (p) 

dp k ,m 



P=P 



P=P 



7l,m 
ili 



n=l 



^*^gi,n,m = l,...,M,fc^l. 



n=l 



L M 



Note that m Ai ^A J ti jm = ^^A^Sf^'™ + 1 and recall the definitions of the sub matrices 

fc=2n=l 

of ^ and D ( T, R, 7 ) . Then equation (1671) and (1681) can be expressed as 



B^A - *J_ iqi = 1, 



(69) 
(70) 



r ~ 1 T 

respectively. Define q = qf , X T . Then (l69l) and (l70l) together form the following linear 
equations 

(D- 1 (f ,R,7°) - * T (T,R)) q = 1. 
Since D 1 ^T, R, 7 ) - * T ^T, R) is invertible [50], q is uniquely given by 

q = (D- 1 (f , R, 7 ) - * r (t, R) ) ' 1 = q, 

which means A, v\ is uniquely given by (1261) and (1271) . 

The rest is to prove Lemma ITT1 First, we give some notations and definitions. Let Vec (M) 
be the column vector obtained by stacking the columns of the matrix M on top of each other. 
Define a convex set of (L^ + (L — 1) M) -dimensional complex vectors 



X 



Vec^f,?^ 



T 



■ Si y o,p_i > } . 



(71) 



We also define the complex normal cone of a convex set as a natural generalization of the real 
one defined in 11751 Sec. 5.2]. 

Definition 9: Denote X a convex set of n-dimensional complex vectors. For x £ X, 

N x (x) = {z £ C n : Re (V (x — x)] < 0, Vx £ X} , 

is called the complex normal cone of X at x. 

Then problem (l59l) can be rewritten as an optimization problem over the convex set X, where 
the cost function and constraint functions are real functions of a complex vector x £ X and 
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an optimal solution is given by x 



~T 

pii 



. The proof is based on the enhanced 



Vec (£1 

Fritz John necessary conditions in Il75l Sec. 5.2], where they are expressed in real number. 
Because a real function of a complex vector x is equivalent to a real function of a real vector 
[Re(x) r , Im(x) T ] T , for convenience, we will later rewrite the enhanced Fritz John necessary 
conditions in 11751 Sec. 5.2] in complex form using the definition of the complex gradient of a 
real function in Section HH Define a function related to the Lagrangian function in (l62l) 

J(rj, A,z/i,x) = J(r/, A,i/i,Si,p_i) 

=Tr (EiA (A, 77)) + r/l T P-i - A T B_i P _i + A T 1 



where x 



Vec^f^i 



T 



log I + HySiH^fi^p^) 

; and A (A, 77) = J2k=2 J2m=i A fc,™ A i!L + V 1 - Because the cost 
function and constraint functions in problem (l59l) are smooth over x G X and X is a nonempty 
closed set, the optimal solution x must satisfy the enhanced Fritz John necessary conditions 11751 
Sec. 5.2]: there exist fj, A, T>\ such that 

-w G N x (x) , (72) 

V, A, V\ > and not all 0, (73) 

where w = V x J(fj, A, Pi, x), and iV^r (x) is the complex normal cone of ^ at x. It can be 
shown that (x — x) , Vx G is real by the definitions of w and X. Then by Definition [9l 
the condition (1721 can be expressed as 

(x — x) > 0, Vx G X. (74) 

We first show that if f\ > 0, the above enhanced Fritz John conditions are equivalent to the 
KKT conditions of problem (1591 ). Then we show that fj = is impossible. This concludes that 
Si,p_i must satisfy the KKT conditions in (1631) and (l64l) . For convenience, define 



V Sl J = V Sl J(t7, A, i/i,Si,p_i), 
V P _ X J = V p _ 1 J(r7, A, p!,Si,p_i). 



Then w = V X J| 3 



p_i^7 



For x 



Vec(^V Sl Jjj ,(V 
Af, where Q ^ can be any positive semidefinite matrix, by (1741) . we have 

w^x-x) = Tr (QV Sl j) > 0. 



Vec Si + Q 



T 



(75) 
(76) 

T 
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In addition to that Vj^J is Hermitian, it implies that Vs t J is a positive semidefinite matrix. 
For x = [0 T , p^x] 7 , by (|74|) . we have 

w f (x - x) = -Tr fSiVs! A > 0. 

Since both Vj^ J and Si are positive semidefinite, we must have 

Tr(SiV Sl j) =0. 

r t i T 

For x = Vec (J^i) , P^i > (T74l) becomes 

(v p _^) T (p-i-p-i)>o. 

Since p_i > and p_i can be any nonnegative vector, we must have 



V p _ 1 J = 0. 



(77) 



If fj > 0, we can make A = \/fj > 0,Pi = vjfj > 0, = V Sl J/r7 h 0. Then, the KKT 
conditions in Lemma QT] are satisfied by observing 

V Sl L = V Sl J/77- = 0, 

Tr (Ei0) = Tr (SiV Sl J/77) = 0. 

The rest is to prove that 77 = is impossible by contradiction. Suppose fj = 0. Define a 
function 

J (A,^i,Si,p_i) = J(fj, A, Pi, Si, p_i) - Tr ^SiV Sl Jj . 

The it follows from 475) and dTTJ) that 

V Sl J (A,Pi,Si,p_i) = 0, (78) 

Vp_ 1 J (A,z> 1 ,S 1 ,p„ 1 ) = 0. (79) 

Note that t[ m V Sl Jti im = because Tr(SiV Sl j) = 0. If we restrict Si to be Si = 
T]diag (pi) Ti, then log I + HiiSiHj ifl^ 1 (p) = rf (p) and J can be rewritten as a function 
of Pi,p 1 



M 



J (A, Vi, pi, p_i) = ^ Pl,mtl,m A (X °) tl.m + A T 1 



m=l 



Pi (Zi-r; (p) -A T B_ip_i. 
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Then (78]) and (|79]) imply 



V pl J (A,z7 1 ,p 1 ,p_ 1 ) = 0, (80) 
Vp-MK^puP-i) = 0. (81) 



Let qi = [q 1;1 , ...,q 1>M ], where <7i, m ,Vl <m< M is given by 

9l,m 



^l7l 2 m 



Pl,m (1 + 71 

Define q = [qf, A T ] T . Then following similar steps as those solving (1671) and (1681) . it can be 
shown that (|80l ) and ( f8Tb form together the following linear equations 

(D- 1 (f , R, 7°) - * T (f , R) ) q = 0. 

Because D 1 (t, R,7°) - * T (t,RJ is invertible [50], q = must hold, which implies 
A = 0, V\ = 0. This contradicts with the condition (1731) that fj, A, V\ can not be all zeros. 
Therefore, we must have fj > 0. 

C. Proof of Lemma \3\ 

It is a consequence of the SINR duality applied to a single-user channel H; = fi^^H^f^" 1 



is 



Decompose Q; and Q z to beams as Qi = J2m=i d i,m^i >m ^l m , where di >m = pi 
the equivalent transmit power and u i)m = Cl 1 / 2 ^/pj^ti >m / y/d^m is the equivalent transmit vector; 



r»i/2 + 

s£ 7 W,m 



and Q, = ]Cmli d hm v hm w] m , where d, >m = <^ m fi^r^ and v Zjm = ^/qj^ri >m / yj d 
According to the covariance transformation, {t i m }, {r / m }, {p^ m } and {gz, m } achieves the same 
set of SINRs in the forward and reverse links respectively. Correspondingly, {u/ >m }, {v ifn }, 
{di iJn \ and achieves the same set of SINRs in the single-user channel H/ and Sj respec- 

tively. Apply Theorem 1 of ll50l to this single-user network, we obtain J2m=i di, m — Sm=i 
i.e.,Tr(Q 1 ) = Tr(Q I ). 

D. Proof of Theorem [7] 

The proof contains three parts. The uniqueness of the covariance transformation is shown 
by the idea that the covariance transformation for a network can be equivalently viewed as 
the covariance transformations for parallel single-user channels, where the capacity achieving 



I 1 / 2 ! 



,1/2 



H.m- 
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input covariance matrix for each single-user channel is unique. The other two parts, the matrix 
equations (l3TT) and (|32|) . are proved by utilizing the polite water- filling structure. 

First we prove the uniqueness of the covariance transformation by showing that different 
decompositions of S 1:i produce the same through the covariance transformation. We will 
use the notations as in the proof of Lemma [3l Suppose is transformed to S 1:£ using the 
decomposition = Y^^iPi,rnXi,rnXi m ,l = 1,---,L with {r Zm } as the corresponding MMSE- 
SIC receive vectors and {qim} as the transmit powers in the reverse links. The corresponding 
equivalent input covariance matrices are Q, = J2mU ^«,mU/, m u| m and Q ? = J2mU ^.mVi.mvJ^. 
For a new decomposition £/ = Y^m=iPim^'im i^iw) ■> ^ = 1> •••)£> we can change from the 
old decomposition to the new decomposition one link at a time. Without loss of generality, let I 
be the first link to be changed and prove remains the same. In this case, the transmission 
schemes of all other links are still {t^m, pk, m , 7^ /}, and the corresponding MMSE-SIC receive 
vectors are still {rfc jm ,V/c 7^ 1} because the interference at link I) does not depend on the 
decomposition of S ; . Furthermore, we artificially fix the transmit powers of all other reverse links 
as {qk, m ,yk 7^ /}. Then both fli and Cli are not changed. Find T,[ by calculating the MMSE-SIC 
receive vectors (r) m } and the unique q[ m such that the SINR of the m th stream of the I th forward 
and reverse link are the same, Vm. Since Qi is a water-filling solution over the forward equivalent 
channel and thus achieves the capacity of ll67l . the corresponding Q) = fl^Xjfij 
achieves the capacity of the equivalent reverse channel fij with power Tr [Qij — Tr(Q^) by 
Lemma [3] applied to a single-user network of link /. Therefore, Q[ = Qi because Q; also achieves 
the capacity of the fij with the same power Tr (Qi^J — Tr(Q^) and the capacity achieving 
covariance matrix is unique |[67l . It follows that = Sj, which implies the interference from 
reverse link / to other reverse links does not change. Then, {r^ m , q' lm } and {r fc m , qk >m , VA; 7^ /} 
achieves the same SINRs as the forward links and thus = S^S/^j is the result of the 
covariance transformation and is invariant with the decomposition of 

Second, we prove (I3TT) . In the rest of the proof, we will only use the terms corresponding to 
link I. The subscript I will be omitted for simplicity. We will use the notations defined in Theorem 
El Noting that GAF 1 = fHHtfH, FAG 1 " = fWHfW, GA 2 G t = Q ^WQ x Un K 
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F+F = I and G^G = I, we have 

ADA = DA 2 
fHGAF^DF^AG* 

= 0"^GDG t GA 2 G t 

= a ^GDG f n ^Hta^Hn^ 

where the last equation follows from (l29l) and (l30l) in Theorem [5] 

Finally, we prove (l32l) . Expand F and G to full unitary matrices F = [FF] G C LrxLr and 
G = [GG] G C LtxLt . We also zero pad A and D to A G C LrxLt and D G C LtxLt . Then 
we have 



v (or 1 - (HSH f + n) v 
--uQ- 1 / 2 (i- (HQHt + 1)" 1 ) Q,- 1 ' 2 

uQ,- l/2 (l - (^FADA r F t + i) J rr 1/2 (82) 

--un-^f - (ada t + i)^ f+n- 1 / 2 

n-^FDF+n -1 / 2 = £ (83) 



where the Equation (l82l) follows from Q = GDG^ and the Equation (f8~3l) follows from D 
(r/I - A~ 2 ) + and Q = FDFt. 



May 4, 2010 



DRAFT 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY, APR. 2010 



65 



E. Proof of Theorem [70| 

The weighted sum-rate function for a two-user Z channel can be expressed as 

/(Si,E 2 ) 



(84) 



=Wilog 



.i^i^i i + h 1i2 s 2 h| 2 



wi (log 



log 



I + H lt2 E 2 Ht j2 |) 



I + H^^^^H^ 2 



I + H 2;2 S 2 H 2i2 
+ (w 2 - Wi)log 

Because the sum of concave functions is still concave, and the first and third terms in (|84l) are 
logdet functions and thus are concave, we only need to show that the second term 



I + H^^^^H^ 2 



log 



I + Hi^S^Hj 2 



g (E 2 ) = log 

is a concave function. 

Consider the convex combination of two different inputs, X 2 >z and Z 2 >z 0, 

S 2 = £Z 2 + (l-t)X 2 

= X 2 + tY 2 , 

where Y 2 = Z 2 — X 2 is Hermitian and < t < 1. Then g (S 2 ) is a concave function if and 
only if (£ 2 ) < for any X 2 >: 0, Z 2 b and < t < 1 ll66l 



It can be derived that [l30ll 

^(E 2 ) = — Tr [AY 2 AY 2 ] + Tr (BY 2 BY 2 ) , 

where A = ^(H2 2 H 2)2 ^ + and B = f 2 Hi j2 j + are positive definite 

matrices. If H 2 2 H 2 2 y Hj 2 H lj2 , it can be proved that A y B. Then 

Tr (AY 2 AY 2 ) 
= Tr (A _1 / 2 Y 2 AY 2 A~ 1 / 2 ) 

> Tr (A-Vsy.BY.A^ 1 / 2 ) 
= Tr (B _1 / 2 Y 2 AY 2 B~ 1 / 2 ) 

> Tr (B- 1 / 2 Y 2 BY 2 B- 1 /2) 

Tr(BY 2 BY 2 ), 

which implies that 4Lg (£ 2 ) < and g (E 2 ) is a concave function. This completes the proof. 
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F. Proof of Lemma [70| 

Without loss of generality, we consider an iTree network where the I th link is not interfered by 
the first I — 1 links. Because problem (1371) is convex, we only need to show that the conditions 
in Lemma [TO] are necessary and sufficient to satisfy the KKT conditions of problem (1371) given 
in (l55l) . The sufficient part is proved in Theorem [13] for general B-MAC networks. We only 
need to prove the necessary part, i.e., if Ei : £ satisfy the KKT conditions with the optimal dual 
variable jl, it must satisfy the conditions in Lemma [TOl Due to the interference structure of iTree 
networks, we have <f>k,i = for k > I, and thus the condition Vs,L| s _ = % _ = can be solved 
one by one from I = 1 to I — L. For I = 1, Ve,L| s fj = can be expressed as 

Cl, = ^-H.\ l (fix + H^SxFLl ,) ~* Hx,! + 40!, (85) 

where Cl\ = I. Because Equation (1851) is also the KKT condition of the single-user polite water- 
filling problem, the solution Si is unique and must satisfy the polite water-filling structure with 
water- filling level w\j\x as in Lemma [TOl By Theorem [71 we have 



Then for / = 2, Ve,L| s ^ = can be expressed as 

^ 2 = ^H 2 2 fn 2 + H2,2S 2 H2 2 ) _1 H 2 . 2 + 4e 2 , (86) 



where 



&2 = I + H\ 2^1111,2 



I + Ht )2 ^ (^fi^ 1 - (fix + Hx^SxH^) ^ Hx, 



Similarly, the solution S 2 must satisfy the polite water-filling structure with water-filling level 
wijfi as in Lemma [101 Following similar proof as above, it can be shown that for I = 3, L, 
must satisfy the conditions in Lemma [TOl This completes the proof. 
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